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PSYCHOLOGICAL REVIEW 


EDITORIAL 
Richard Solomon’s distinguished ога theoretical development. Reviews 


editorship of the Review is being com- 
pleted in the present issue." However, 
the processing of manuscripts was 
transferred to the new Editor in Janu- 
ary 1964, On the basis of a year's 
experience with manuscripts submitted 
for publication in the Revicw, certain 
comments seem to be in order concern- 
ing the policy as to what kinds of 
articles are appropriate for the Review. 
The Review is the APA journal соп- 
cerned with psychological theory and 
its development, The guiding princi- 
ple used by the Editor and his con- 
Sultants in evaluating manuscripts for 
their appropriateness to the Review i 
that the major theme of an article must 
be a theoretical one (in any area of 
Le ng Many manuscripts re- 
femel = the past year have been re- 
da ace : their authors because they 
бнаа y^ = criterion of appropri- 
several А uch manuscripts fall into 
Classes, 

ao CE 
topic or ht on some 
for the Review, The, not appropriate 
of the Psycholo Lr the province 
theoretical article | edt Buletin: Ж 
be supported b a of course, often 
ture. The V wipes Ө ше [Пе 
article, however, if eature of such an 
priate for the Re: И it is to be appro- 
Vie, must be a theory 


1 

i s papers by Minard, Bahrick, Moltz, 

pro eterson, published in this issue, Were 
cessed by Richard Solomon. 


of the literature may lead to rejection 
of or the discovery of critical weak- 
nesses in a theoretical position, but in 
the preparation of such papers the 
theoretical question should be pard- 
mount and the statement of a theoreti- 
cal development or alternative should 
receive careful and detailed attention. 

2. Statistical Articles. Papers on 
statistics and problems of experimental 
design are again the province of the 
Bulletin. (It should be noted that the 


Bulletin. encourages interpretive arti- 
cles. A Bulletin article should serve 
as a bridge between statistical contribu- 
tions per se and the typical research 
psychologist.) 
3i Methodological Articles. These, 
too, belong to the Bulletin, unless a 
flaw in method has led to theoretical 
misconceptions of major degree. Pa- 
cerned with the use of control 
fic pieces of apparatus, 
f methods, and the like 
e for the Review. í 
iical Studies. Reports 0 

Е: жр other kinds of studies 
experimen oe for the 
as such hy- 
Review, even t 


Пу not appropri 
epe burden of the paper 15 2 


theoretical development. 
honored practice for em 
to publish studies design 
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potheses, and this is true no matter 
whether the outcome is favorable or 
unfavorable to the hypothesis or deduc- 
tion. А 

Empirical work тау be appropri- 
ately included in an article for the 
Review, but only when the report of 
empirical material occurs in the context 
of theoretical matters. The major 
burden of the paper must be theoreti- 
cal, with the empirical investigation in 
a supporting role. The report of the 
study or experiment must be suffi- 
ciently detailed that it can be evaluated 
by the criteria used in empirical 
journals. 

5. Papers Expressing an Idea, a 
Definition or a Concept. A number of 
papers have been received whose con- 
tent is “thin.” They may include an 
idea, or concept, or perhaps a definition, 
but do not reflect a detailed working 
out of the theoretical aspect or implica- 
tions. Such Papers do not meet the 
standards of a Psychological Review 
article. Such Papers, however, if brief, 
may be publishable in the Theoretical 
Notes section, which will continue un- 
der the new Editor, and should be 
Prepared with this goal 

Papers Prepared for Other Pur- 
poses, Speeches, Symposium presen- 
tations, technical reports, and term 


N. Corer 


papers have been submitted for publi- 
cation in unaltered and unedited form. 
While such papers may be good ones, 
their form suggests that the authors 
have not considered very seriously the 
nature of the journal and of its audi- 
ence to which they have sent the papers. 
Papers of this kind are seldom pub- 
lishable in the Review as they are sub- 
mitted and may be returned to the au- 
thors summarily without editorial 
review. All manuscripts submitted 
must conform to the usages of the 
APA Publication Manual. 

There are, of course, papers whose 
appropriateness for the Review is dif- 
ficult to decide on the basis of the 
guidelines just indicated. Authors, in 
Such cases, are free to send their manu- 
Scripts to the journal they deem to be 
appropriate, — Tt usually serves no 
Purpose to write to the Editor for an 
opinion of the appropriateness of a 
paper described in a general way. 

The new Editor has continued and 
will continue the practice of using а 
large number of consultants, each one 
only on occasion, to assist him in the 
task of evaluating manuscripts. То 
those who have already served and to 
those who will serve in the future, he 


extends grateful acknowledgment of 
their assistance, 
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A MODEL OF 


EBERHARD ZWICKER 


LOUDNESS 


SUMMATION' 


BERTRAM SCHARF 


AND 


Technische Hochschule Stuttgart 


Northeastern University 


A psychophysical model is presented that explains why loudness 
summates across frequency as it does and that permits the precise 


calculation of loudness from the physical spectrum. 


Loudness is 


represented by geometrical patterns derived from the masking of 


pure tones by narrow bands of noise. 


The masking patterns are 


converted to loudness patterns by means of the critical-band function 
that relates tonalness in Barks to frequency in cycles per second and 
a power function that relates specific loudness, loudness per Bark, to 
sound pressure level (SPL). Plotted on the coordinates of specific 


loudness and tonalness, the geome 


trical patterns are integrated to 


yield a value in sonese for the overall loudness. Calculated values are 
compared to experimental values obtained from loudness balances with 


3 types of sound. 


Loudness summates across fre- 
quency. Thus two pure tones are 
usually louder than either alone. Their 
Overall loudness is seldom, however, 
the sum of their individual loudnesses. 
The failure of loudness to summate 
Perfectly is ascribed to mutual inhibi- 
tion between the tones (cf. Stevens, 
1956). Since the degree of inhibition 
depends upon both the intensity of the 
tones and their frequency separation, 
no simple rule can serve to predict 
their total loudness directly from their 
Physical properties. Nevertheless, it 
1$ possible to represent each of the two 
ae or indeed any number of tones in- 
к: е infinite number contained 
п а continuous spectrum, by geometri- 
cal patterns that permit a precise analy- 
Fed of loudness summation. These pat- 
Ы ns аге the essential feature of the 
Model of loudness summation presented 
In this paper. 


1 ч 
is Preparation of this paper was supported 
Seer by United States Public Health 
atio eh Grant NB 04464 from the 
and B Institute of Neurological Diseases 
| indness. The authors wish to thank 
Se ета for helpful discussions and sug- 
5 concerning the paper. 


Although often described below in 
broad physiological terms, the model is 
based entirely on psychophysical meas- 
ures of masking, loudness, and the 
critical band, and is therefore wholly 
psychological. The scales of "specific 
loudness” and “tonalness,” which re- 
place the physical scales of intensity 
and frequency, provide the coordinates 
for the geometrical representation of 
loudness. The specific loudness scale, 
unlike the sone scale, is a derived scale 
and represents the loudness produced 
over each unit of tonalness. The scale 
of tonalness is based on the critical 
band as measured in experiments on 
the threshold for complex sounds, two- 
tone masking, phase sensitivity, and 
loudness summation itself (F eldtkeller, 
1955; Scharf, 1961b). 

The use of a geometrical representa- 
tion of loudness follows from the place 
theory of hearing which assumes a 
local pattern of excitation produced 
within the auditory nervous system by 
a pure tone or by each pure tone com- 
ponent of a complex sound (a sound 
composed of two or more frequency 
components). In the present model, 
the shape and level of the local patterns 
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are inferred from the masking of pure 
tones by narrow bands of noise. A 
complex sound is represented by a 
number of these local patterns whose 
integrated area yields a measure of 
the total loudness. : 

Some of the basic notions underlying 
the model such as the conversion of 
the intensity and frequency scales to 
psychological scales and the derivation 
of loudness patterns from masking pat- 
terns are not new (Fletcher & Munson, 
1937; Harris, 1959; Howes, 1950; 
Munson & Gardner, 1950). New are 
the concept of the specific loudness and 
the use of the critical band and also 
the application of the model to a num- 
ber of different types of sound. 

Parts of the model have already 
been described elsewhere (Scharf, 
1964; Zwicker, 1958, 1963). The 
whole model is here considered and 
tested against a variety of empirical 
measurements of loudness. As will be 
seen, the model, based on a few simple 
and general rules, accounts for most 
of the complex data on loudness sum- 
mation. 

Before introducing the model and 
testing it against experimental meas- 


ures, we review briefly the basic facts 
of loudness summation, 


Basic Facts or LOUDNESS 
SUMMATION 


Perhaps the most striking fact of 
loudness summation, defined as the in- 
crease in loudness with the spread of 
energy over frequency, is that it does 
not begin until the energy is spread 
over more than a critical bandwidth 
(Frequenzgruppe) (Zwicker & Feldt- 
keller, 1955; Zwicker, Flottorp, & 
Stevens, 1957). Up to the critical 
band, the loudness of a complex sound 
is largely independent of the width of 
energy spread. Loudness first and 
rather abruptly begins to increase with 


bandwidth at the critical band. The 
value of a critical band centered on a 
given frequency is the same at all in- 
tensities, but beyond it the amount of 
the increase in loudness depends upon 
the intensity of the sound (Zwicker & 
Feldtkeller, 1955; Zwicker, Flottorp, & 
Stevens, 1957). Indeed, close to 
threshold е loudness decreases 
slightly rather than increases (Scharf, 
19592). Ағ sensation levels above 
10 decibels, however, some increase in 
loudness with bandwidth may be noted, 
with the greatest increases at moderate 
sensation levels between 40 and 60 
decibels. 

These rules hold regardless of where 
in the frequency spectrum the sound 
is located (Zwicker & Feldtkeller, 
1955; Scharf, 19593). They also hold 
whether the threshold is normal or is 
raised to an abnormally high level by 
the presence of a moderately intense 
uniform masking noise (Scharf, 1961a) 
or by a middle-ear impairment (Scharf, 
19625). Тһе complex sound itself 
may be composed of any number of 
components from two up to the infinite 
number comprising a band of white 
noise (Scharf, 1959b), А complex 


sound at a given sensation level is loud- 


est, however, when the components are 


all equally loud (Scharf, 1962a) and 
when they are Separated by an equal 
number of critical bands (Zwicker, 
Flottorp, & Stevens, 19575, 
The relation between loudness and 
bandwidth is the Same even for sounds 
of short duration, although the loud- 
ness of a complex sound decreases as 
15 duration is decreased below 70 
milliseconds (Port, 1963a, 1963b). 


DESCRIPTION AND DEVELOPMENT 
OF THE Moprr 


Although the experimental measures 
Ё loudness Summation yield a com- 
plex pattern of results, a single model 
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la. Fi " 
| of E d diagram is the physical spectrum 
| against PO Tps tone. (Intensity I plotted 
mplitude doe f. Next is plotted the 


A of the displacement caused by 
e placem 

ae Ti the basilar membrane as a func- 
Third is istance ] from the helicotrema. 
Tgan of En the Excitation E at the 
[The symb, on as a function of tonalness 2. 
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а function wn the Specific Loudness N’ as 
the Toud of Tonalness z. The area under 
total Jou, ness pattern corresponds to the 

ness of the tone.) 
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сап епсо 


Outline ™Mpass them all. First let us 


a ee the model as it is applied 
hen 5: кч tone and to white noise. 
Velopm, shall review the detailed de- 
i eg of the model. 
th ni l traces the transformations 
15 g rou eet to occur when the ear 
and h ated by a pure tone (Fig. 1а) 
Phys White noise (Fig. 1b). The 
measurement of an ideal pure- 
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1b. The spectrum ofa white noise followed 
by the displacement, excitation, and loudness 
patterns produced by the noise (Zwicker, 
1958, adapted with permission of Acustica). 


a loudness pattern. 


tone stimulus at the ear drum yields 7 
line spectrum that shows all the E 

energy concentrated at a single n 
quency. Beyond the ear drum, the 
tonal intensity and frequency are trans- 
mitted by the ossicles of the middle ear 
to the oval window. At that point 


the inner-ear fuid and consequently 


the basilar membrane are set 1n mo- 
tion Békésy's measures of the vibra- 
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tion of the basilar membrane permit us 
to plot the maximum amplitude A of 
the displacement produced by the tone 
over the length of the basilar mem- 
brane, as shown in the second part of 
Figure la. This first, important step 
in the development of a model for loud- 
ness summation shows that energy 
confined to a single point on the physi- 
cal frequency scale produces a displace- 
ment over a wide area of the basilar 
membrane. The displacement of the 
basilar membrane somehow activates 
the sensory cells whose neural response 
is then translated through several 
synaptic junctures to higher levels of 
the nervous system. The wide lateral 
displacement on the basilar membrane 
is apparently greatly reduced at the 
neural level, and the pattern of Excita- 
tion E looks like that in the third part 
of the figure. (The excitation pattern 
is based primarily upon masking data, 
as explained below.) 

The value of the excitation relative 
to an arbitrary reference E, is plotted 
here against the scale of tonalness (z) 
which is related in an approximately 
linear fashion to distance on the basilar 
membrane. The excitation pattern is 
then transformed into a loudness pat- 
tern, depicted in the fourth part of 
Figure la, by means of a special equa- 
tion relating Specific Loudness N’ to 
Excitation E. The total loudness pro- 
duced by the original tone is the in- 
tegral of the area under the specific 
loudness pattern. 


Thus the first part of the figure 
comes from a purely physical measure. 
The second part, the displacement over 
the basilar membrane, comes from 
physiological measures. The last two 
parts of the figure may, in the next 
years or decades, also be measured 
physiologically, but in the meantime 
they are derived from psychophysical 
measures to form the basis of our 


model of loudness and loudness sum- 
mation. 

Now let us look at the same se- 
quence for white noise. The energy in 
white noise is distributed over fre- 
quency as shown in the first part of 
Figure 1b. The displacement pro- 
duced along the basilar membrane is 
approximated from measurements on 
models of the basilar membrane 
(Oetinger & Haüser, 1961). The ex- 
citation and loudness patterns for the 
noise are quite different from the pat- 
terns produced by a pure tone. 

Whereas the first two diagrams of 
Figure 1 are well-known physical and 
physiological measures, the representa- 
tions of excitation and specific loud- 
ness are new and require explanation. 
The explanation may be broken down 
into five parts to which a sixth is 


added for sounds heard against a back- 
ground of noise. 


l. The source of the excitation 
measures 

2. The relation between the tonal- 
ness scale and the physical fre- 
quency scale 

3. The function relating the spe- 
cific loudness to excitation 

4. The dependence of the trans- 
mission characteristics of the mid- 
dle ear on frequency 


5. The generation of the total 
loudness 


6. The effect of partial masking 
on loudness 


1. It is assumed that both the exci- 
tation and the loudness patterns gen- 
erated in the nervous system upon 
acoustic stimulation can be derived 
from the masking patterns, which can 
be empirically measured. This as- 
sumption is similar to that originally 
set forth by Fletcher and Munson 
(1937) and adopted by Steinberg and 
Gardner (1937) and also by Harris 
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(1959). The masking pattern for a 
given sound is obtained by using the 
sound to mask a pure tone. The plot 
of the masked threshold for the tone, 
asa function of its frequency, is the 
masking pattern. 

Ns ы to avoid the necessity of 
ea ring the masked thresholds with 
icu sound to which the model is ap- 
Mic a ў of standard masking pat- 
sedia: their derived excitation 
sm ы, been made available for 
1963) oe of noise (Zwicker, 1958, 
"o hese narrow-band patterns 
зы ы: the masking patterns of any 
к ишмар than or equal to а 
sinse th and, including pure tones, 
tiall is masking patterns are essen- 
Шаш he same for all bandwidths less 
IW — band (Ehmer, 1959; 
nn к. 956). Combinations of pat- 
E ie ntered at different frequencies 
Mu the masking patterns of 
i wider than a critical band. A 
Plate UN patterns is shown in 
ja n or the masking of pure tones 
1200 rrow band of noise centered at 
racc cps. The parameter on the 
(SPL) | the sound pressure level 

"he K the masking noise. 
Wes e E in the transformation 
nigri patterns to excitation 
ing tn the conversion of the mask- 
bag at each frequency to the cor- 
Чо sua E excitation level. But how 
masked лнн from the measured 
within t reshold the excitation level 
k a nervous system produced by 
nnd E ing sound? The answer 
noise 1 zx data on the self-masking of 
intensit » from measures of the jnd for 
excitation ees in such measures the 
г dnd d terns of the masked noise, 
most ident masking noise, I, are al- 
tien ie their ratio, А//7, may 
the iud © express the ratio between 
M ard ation of the just masked stimu- 
that of the masking stimulus. 
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Fic. 2. Masking effect of a narrow band 
of noise with a center frequency of 1,200 cps. 
The parameter is the effective SPL of the 
noise. (On the ordinate the masked thres- 
hold for a pure tone is plotted for each level 
of noise as 2 function of frequency [Zwicker, 
1958, adapted with permission of Acustica.) 


Fortunately over a wide range of in- 
tensities the ratio, AI/I, is fairly con- 
stant, being equal to 3 for the lower 
and middle frequencies and to } for 
the higher frequencies (Zwicker, 
1956). The minimum excitation re- 
uired then to mask completely at a 
given frequency is twice (3 decibels) 
or four times (6 decibels) the intensity 
at the masked threshold. The addition 
of 3 (or 6) decibels to the masked 
threshold gives the value for the excita- 
tion level. Excitation level will be 
expressed in decibels as Lg — 10 log 
E/Eo where Eo is a reference value 
corresponding to Јо = 10% watt/ 

centimeters”. . 
2. The representation of the excita- 
tion pattern requires not only the con- 
of the ordinate from masking 


level to excitation level, but also the 
i { the abscissa from the 


physical frequency scale to one that 


corresponds more closely to the way m 
which activity spreads in the auditory 
system. good correspondence ap- 
pears to be achieved by the tonalness or 
The spread of activity тер- 


z scale. 
resented by the 2 scale probably corre- 


sponds to 4 spatial spread of activity 
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Fic. 3. Relation between tonalness in 
Barks and frequency (Zwicker, 1961, adapted 


with permission of Journal of the Acoustical 
Society of America). 


on the basilar membrane, but at higher 
neural levels, the spread is not neces- 
sarily spatial) The relation between 
tonalness and frequency is given by the 
function in Figure 3 (Zwicker, 1961). 
The unit, Bark (after the German 
acoustician Barkhausen), replaces the 
critical band in order to distinguish 
the Bark as a measure of sensory 
events from the critical band, a stimu- 
lus measure. The z function seems to 
be an appropriate measure of auditory 
activity because of its crucial role as 
the unit of integration in loudness sum- 
mation, threshold measurements, 
masking (Greenwood, 1961; Schart, 
1961b). Moreover, a similar function 
relates the jnd of pitch, the fre 
representation on the basilar 
brane, and the mel sc 
frequency. 

With the aid of the tonalness func- 
tion, the masking patterns of Figure 2 
have been converted to excitation pat- 
terns in Figure4. (Excitation patterns 
for narrow bands of noise centered at 
lower and higher frequencies are simi- 
lar to those in Figure 4; see Zwicker, 
1958.) The excitation patterns are for 


and 


quency 
mem- 
ale of pitch to 


a band of noise no wider than a critical 
band, centered at 1,200 cps and having 
the SPL shown as the parameter on 
the curve. These curves also repre- 
sent the excitation produced by a 1,200- 
cps tone, since the masking patterns 
produced by pure tones and by narrow 
bands of noise are almost the same 
(except where beats and audible non- 
linear distortions occur). 

3. In order to use these excitation 
patterns for the derivation of loudness, 
it is necessary to know the relation 
between loudness and excitation or 
more exactly, between Specific Loud- 
ness N’ and Excitation E. A psycho- 
physical equation expressing this rela- 
tion has been formulated (Zwicker, 
1958, 1963). Tt is based upon Stevens’ 
power law, which says in one form that 
equal intensity ratios yield equal loud- 
ness ratios (Stevens, 1957) ; it is also 
based upon the assumption that the 
loudness of any sound is the integral of 
the specific loudness over the z scale. 
In this sense every sound involves the 
summation of loudness, for Figure 4 
shows that the excitation aroused by 
even a pure tone spreads over a con- 
siderable portion of the z scale, The 
next few paragraphs trace the deriva- 


моа 


2°. € ae 


Tonainess in Barks 


ж з 3 2 X 


Mic. 4 Excitation patterns calculated 
from the m 
band of noi 
noise is parameter on the curves, 
Zwicker, 1958; adapted with permission of 
Acustica.) 
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tion of the basic formula that relates 
specific loudness to excitation. 
Stevens’ law may be expressed as: 


AN _ y 

N 1 

where I is the intensity of a tone and 
N is the loudness. In terms of excita- 
tion, we must assume that this law ap- 
Plies only to the excitation over a 
Single Bark. It is not feasible to ap- 
Ply this law to the whole excitation 
Гав by a pure tone since the exci- 
lee pattern changes considerably 
T d with frequency and intensity. 
Б critical band or Bark provides a 
Stant unit over which the excitation 
кв be converted to loudness by means 
tö eagle equation. The loudness due 
allea excitation over a single Bark is 
cues pis specific loudness," repre- 
tem y the symbol №. With In- 
; Sity I replaced by Excitation E, 

quation 1 becomes: | 


[1] 


[2] 


erin gat threshold, where intensity dis- 
Gear is poor, the relationship 
to the Fie unless a constant is added 
His es enominators of Equation 2. 
freres ar E, may be thought to 

е ear | the excitation produced in 
ed by an inaudible physiological 
Suppress НЄ ness, This excitation can 
an exte а weak excitation produced by 
lower we stimulus thereby setting à 
he see the absolute threshold, for 
ing į S Sensitivity. The correspond- 
» Naudible specific “loudness” is 


für. 
C AN 
h AP oe N EN 
reati Р . 
апана Equation 3 as a differential 
9n and integrating, we have 


[3] 


log (Ns. + N’) 
= k log (Е, + E) + log Cor [4] 
М' + № = C (Ey + E). [ај] 


Taking as our boundary conditions N’ 
= 0 when ЕЁ = 0, we obtain for the 
constant of Integration C: 


or [5] 


N'a = C (Ej). [Sa] 


The evaluation of the constant Egr 
depends upon the same assumption 
used to convert from masking to exci- 
tation patterns, namely, that the mask- 
ing excitation must be twice or four 
times the excitation produced by the 
just masked tone. It is assumed that 
the internal background excitation is 
twice the excitation E, produced by an 
external tone at the absolute threshold 
(or four times if the tone is at a high 


frequency). 
Ej 2E, [6] 


Using Equations 5 and 6 to substi- 
tute in Equation 4 we arrive at: 


2] =) om 


In order to express the value N',, in 
relative values of Excitation E,/E,, we 
introduce the reference value № ‘are 
Using Equation 5a with 2E, replacing 
EQ, We obtain: 


[8] 


Equation 7 may now be written : 


x DES + jy - 1] [9] 
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Fic. 5. Calculated and observed (dot- 
dash line) difference in SPL between a 
1,000-cps tone and a uniform masking Noise. 
(Calculations were made with three different 
values of k inserted in Equation 9 [Zwicker, 
1958; adapted with permission of Acustica].) 


N'or, is an arbitrary constant that de- 
termines the units in which the specific 
loudness is measured; E, is the excita- 
tion corresponding to Г, taken as 10-18 
watt/centimeters?; E is the excitation, 
at a given point on the z Scale, evoked 
by the stimulating tone. 

The exponent b remains to be de- 
termined. 16 importance becomes 
clear at large values of E where Equa- 
tion 9 approaches the simpler relation 
№ ~ (ЕЈЕ,)к The value of k can 
not, however, be determined from 
Equation 9 by the direct measurement 
of N’, the loudness arising from excita- 
tion restricted to a single Bark. Such 
a restricted excitation pattern can not 
be produced in the ear. This difficulty 
has been bypassed, and the value of the 
exponent calculated from measure- 
ments of the loudness of two very dif- 
ferent sounds (Zwicker, 1958). One 
sound was a 1,000-cps tone and the 
other a uniform stimulating Noise, a 
noise that has the same overall inten- 
sity in each critical band. The two 
sounds were matched for loudness at 
various levels. The observed differ- 
ences in loudness level between the 
noise and tone are plotted as the dot- 


dash line in Figure 5, On the same 


plot are shown the differences calcu- 


lated from Equation 9 with 3 different 
values of k. In these calculations the 
"Total Loudness N of the tone and also 
of the noise is the integral of the spe- 
cific loudness over the z scale, that is 


2224 
N- Í N'ds. [10] 
КеП 


The best fit between the observed and 
calculated differences is obtained with 
k = 0.23. 

All the unknown constants of Equa- 
tion 9 have now been determined ex- 
cept the arbitrary constant N'or,- This 
constant is chosen so that the integral, 
SN'dz, of the specific loudness of a 
1,000-cps tone at 40-decibel SPL is 
equal to 1 sone. The unit assigned to 
the specific loudness is soneg, where 
the subscript G distinguishes this cal- 
culated value from the sone obtained 
by direct loudness measurements, 

After some smoothing at low levels, 


we arrive at the final equation for the 
specific loudness, 


" ESS 
N' —0.08 (2) 


Е 23 
Ж [c E +1/2) -1| 

in soneg/Bark [11] 

This equation is graphed in Figure 
6 with both E, and Е expressed in 
decibels as І, and Ly. Given the exci- 
tation level at threshold and given the 
excitation level due to the stimulating 
tone, the specific loudness, N’ in 
Sonesg, may be found on the graph at 
each Bark under the excitation pattern. 
4. The ordinate of Figure 4 should 
but Ly—a. The 
error introduced 
asked thresholds 


ә not only because 
Я Produced within 
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because more energy is transmitted by 
the middle ear to the receptor cells at 
some frequencies than at others. The 
value of а is in a sense the amount of 
Stimulus energy lost (or gained, de- 
pending on the reference value chosen) 
aS a given tone is transmitted to the 
oval window. The masked threshold 
for the tone would then be greater by 
a decibels and would yield a value for 
the excitation level Lr of the masking 
stimulus that would be a decibels too 
high. 

Direct measures of the transmission 
Characteristics of the middle ear are 
apparently not available, but they may 
be approximated if we assume that the 
transmission characteristics govern the 
dependence of the absolute threshold 
on frequency (at least above 2,000 
cps). Such an assumption implies 
that neural sensitivity is constant over 
most of the audible frequency scale (up 
to 9,000 or 10,000 cps) and that, for 
example, the average threshold is 18 
decibels lower at 4,000 cps than at 
8,000 cps (Robinson, 1957) because 
the energy transmitted to the oval win- 
dow is 18 decibels less at 8,000 cps. 
The value of a would then be 18 
decibels higher at 8,000 cps than at 
4,000 cps. 

Below 2,000 cps, the rise in the 
absolute threshold with decreasing fre- 
quency is ascribed to increased internal 
Noise rather than to reduced middle-ear 
transmission. Since the internal noise 
has already been included in Equation 

as part of the expression E:/Eo, no 
Other account need be taken of it here. 
"а to a is omitted in the rest of 

is paper for the sake of simplicity. 
M. The model of loudness described 
of ar for a single tone or narrow band 
mes may also be applied to com- 
еъ sounds such as wide bands of 
fhatiy, and multiple tones. Two alter- 
ve procedures are available. 


Spacitic loudness, N, In Sonas,/Bork 


Excitation level, Lg , in decibels 


Fic. 6. Specific Loudness № in sonec/ 
Bark as a function of the Excitation Level 
Lr. The exponent is Е = 4. (The param- 
eter on the curves is the Excitation Level 
L: of the test tone at threshold [Zwicker, 
1958; adapted with permission of Acustica].) 


The first procedure is exactly like 
that described above for obtaining the 
loudness value for a single tone. The 
masking pattern produced by the com- 
plex, as masking sound, is measured 
and transformed to the excitation pat- 
tern. The specific loudness can be read 
from Figure 6 as a function of tonal- 
ness to yield a loudness pattern whose 
integral over z is the total loudness. 
In the second procedure, the measure- 
ment of the masking pattern of the 
complex sound is replaced by the 
much simpler measurement of the SPL 
in each component critical band of the 
stimulus. Each band is treated as an 
independent stimulus that gives rise 
to an excitation pattern like those 
shown in Figure 4. The overlapping 
patterns thus obtained have a common 
upper envelope, which determines the 
excitation level; wherever two or more 
patterns overlap, only the highest exci- 
tation level is used. This procedure 
requires, of course, the prior measure- 
ment of the masking pattern for bands 
of noise at various SPI.s and centered 
at various frequencies. Fortunately 
the shape of the masking patterns 
changes slowly enough with center fre- 
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Fic. 7. Specific Loudness М, of Partially 
masked pure tones as a function of the exci- 
tation level of the tone. (The parameter on 
the curves is the Excitation Level Ly of the 
masking sound [Zwicker, 1963; adapted with 
permission of Acustica].) 


quency so that measurements of the 
masking patterns at only 6 or 7 scat- 
tered frequencies suffice to give good 
approximations of the patterns over 
the whole audible frequency spectrum. 

Examples of the application of both 
procedures are given below. 

6. The loudness of partially masked 
sounds must be treated somewhat dif- 
ferently from the loudness of unmasked 
sounds. A sound is partially masked 
when its loudness is reduced by the 
Presence of another sound. Partial 
masking occurs (and the measure- 
ment of the usual masked thresholds is 
possible) because the components of 
a complex sound may be distinguished 
from one another; their loudnesses do 
not always summate to yield an overall 
loudness. The question now posed is 
how our model can handle partial 
masking. 

A series of measurements (Zwicker, 
1963) showed that the present model 
underestimates the loudness of a 1,000- 
cps tone partially masked by either 
narrow-band or wide-band noise at 
various levels, This underestimation 
could be corrected, however, by an ad- 

justment of the curves in Figure 6. 
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Apparently when the excitation at 
threshold, L;, is produced by an exter- 
nal masking noise, the loudness of a 
partially masked tone increases more 
rapidly with intensity than when the 
threshold excitation is produced by in- 
ternal noise. Consequently, the curves 
of Figure 6 must be steeper if they are 
to be used with partially masked 
sounds. Adjustment of the curves in 
Figure 6 to yield loudness patterns for 
a partially masked tone whose integral 
better approximates the measured loud- 
ness leads to the curves of Figure 7. 
In place of the excitation level, L,, at 
threshold, the excitation level, Ly, of 
the masking stimulus is the parameter 
on the curves, 

At low values of Ly and L, the 
curves in Figure 7 resemble those in 
Figure 6. Since, after the correction 
for the transmission characteristics of 
the middle ear, L, is small for all but 
the low frequencies, perhaps the curves 
of Figure 7 ought to supercede those of 
Figure 6. Until it is known, however, 
which set of curves is more appropriate 
for the low frequencies, where internal 
masking is assumed to account for the 
large absolute threshold, it is expedient 
to retain the curves of Figure 6, which 
have already been in circulation for 
some time. Furthermore, the curves 
of Figure 7 are for pure tones masked 
by noise and probably do not accurately 
describe the growth of the specific 
loudness of tones masked by other 
tones or of narrow-band noise masked 


by tones or by other noises (Zwicker, 
1963). 


APPLICATION oF THE Moper to Loup- 
NESS SUMMATION AND ANALYSIS 

. Our model of loudness serves two 

important and closely rel 

It permits an underst 

changes in loudness un 
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and analysis. A convenient transfor- 
mation of the input stimulus into exci- 
tation and loudness patterns provides a 
means for inferring the interactions 
within the auditory system. In terms 
of these interactions, most of the facts 
of loudness summation may be ex- 
plained. At the same time the model 
permits the precise calculation of loud- 
ness so that manipulations of the model 
to predict (or postdict) the results of 
subjective measurements may be put to 
rigorous test, (The model is also pre- 
sented elsewhere [Zwicker, 1959] in 
a more schematized version for third- 
octave band filters that simplifies the 
calculation of loudness where a numeri- 
cal value is the only requirement as in 
audio engineering. ) 

In this section, the model is applied 
to three types of sound. In each case 
an important aspect of loudness sum- 
mation or analysis is explained in 
terms of the model, and calculated 
values for loudness ar 
measured values, 


band of white noise, The second sound 


Loudness of Bands of Noise 


The loudness of 
of constant Over; 
unchanged as i 


all centered at a geometric mean of 
1,480 cps and all with an overall SPL 
of 50 decibels. These excitation pat- 
terns are based upon masking patterns 
like those in Figure 4. For the noise 
one critical band wide, the excitation 
pattern corresponds to that produced 
by a pure tone or narrow band of 
noise. For a noise wider than a critical 
band, the patterns produced by contig- 
uous critical bands have been combined. 
By means of Equation 11 as graphed 
in Figure 6, the values Ly are con- 
verted over tonalness to the specific 
Se values are plotted 
as the loudness Patterns in Figure 8b 


and also in Figure 8с. The more 


ch ons of Figure 8c fa- 
cilitate integration and show more 
patterns change with 
The loudness levels computed 
from Figure gc were almost identical 
to those from Figure gb, Loudness 
values Were also computed for bands of 
noise at 15 and 100 decibels from the 


loudness patterns pr in Fi 
а 5 presented in Figures 
9 and 10, и 


car from these patterns 
why loudness is Constant within the 


Why its increase 


loudness of each of the 
Portional to the ar 
ness pattern 


5 noises is pro- 
ёа under the loud- 
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Fic. 9. Specific loudness patterns for 5 bands of noise, all at 15-decibel SPL. 
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Fig. 10. Specific loudness patterns for 5 bands of noise, all at 100-4 


pu н exceeded, however, the shape 
o e loudness pattern begins to 
ike i Its height decreases (owing to 
Pe ues in the intensity of each 
ка. Бапа), and it extends over а 
Neq portion of the z axis. At mod- 
ue d “age such as 50 decibels in Fig- 
ine, the vertical decrease of the 
pede more than compensated by the 
ein ntal increase, so that the total 
ese the total loudness increase. 
es a though the excitation and with 
е height of the central portions of 


the loudness pattern decrease as band- 
width increases, excitation at the lower 
and upper tonalness values increases 
sufficiently and enough new elements 
come into play to raise the overall loud- 
ness. Here we see why loudness in- 
creases with bandwidth at moderate 
Why doesn't it increase as 
much at low and at high levels? 
Figure 9 provides an answer for soft 
noises. Two changes from the 50- 
decibel noise are evident. First the 
height of the loudness patterns de- 


levels. 
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Fic. ll. The dependence of the loudness 


of a band of noise upon bandwidth, (The 
parameter on the curves is the overall SPL 
of the bands of noise. Open circles are 
values measured in an experiment by 
Zwicker, Flottorp, and Stevens, 1957, Filled 
circles аге the values calculated from 
Equation 11, An excellent fit between meas- 
ured and calculated values obtains.) 


seen in Figure 6, 
the loudness of a 


level is that the 
patterns are narrower 


terns, loudness 


bandwidth in. 
creases beyond the Critical band, 


he situation is 


reversed as a comparison of Figure 10 
with Figure 8с reveals. Here, the 
height of the loudness pattern changes 
with increasing bandwidth in approxi- 
mately the same proportion as at the 
moderate level, but the loudness pat- 
terns are spread over considerably 
larger portions of the tonalness scale 
at the higher end. The Spread is in- 
itially so extended that increasing the 
bandwidth beyond one critical band 
increases at first only the area at the 
lower tonalness values. At the same 
time, the specific loudness is reduced 
at the middle tonalness values, offset- 
ting any gain from the greater spread. 
Consequently the loudness does not be- 
gin to increase until the stimulating 
bandwidth extends over 3 to 5 critical 
bands. 

Integration of the areas under the 
loudness patterns of Figures 8c, 9, and 
10 yields values for the loudness level 


of the bands of noise. These values 
are plotted as the filled с 
ure 11. 


on the abscissa 
Flottorp, & Stevens, 1957 7 
all SPL of the bands of n 
constant at 30, 50, 80. 
The calculated and m 
close to one another, 
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nearby radio. Yet the presence of the 
one sound may distinctly affect the 
loudness of the other. Such a situation 
arises when a tone and a band of noise 
are presented together, and the model 
may be used to understand some of 
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Fic. 12. Excitation patterns for a narrow-band masking no 


саблей spectra for the noise and tones are also shown. 
is тен produced by the tone is suppressed by 
€duced, interference with the higher frequency to! 


© lower frequency tone. Thus although the higher frequency 


the interactions that take place within 
the auditory system and also to calcu- 
late the loudness of the tone. 
Measurements are available of the 
loudness of pure tones partially masked 
by a narrow band of noise (Scharf, 
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higher frequency tone grows in loudness more ra 
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1964). The tones were located at fre- 
quencies above and below the frequency 
range of the noise, which was one criti- 
cal band wide centered at 980 cps. 
Whereas complete masking was shown, 
as usual, to be more effective toward 
the higher frequencies, partial masking 
was found to be generally more effec- 
tive toward the lower frequencies. 
This change appears paradoxical until 
the model is used to analyze the excita- 
tion patterns produced by the tone and 
band of noise. 

Figure 12 shows the standard exci- 
tation patterns for two of the tones and 
for the band of noise used in the ex- 
periments. At the top of the figure 
are the idealized spectra of the three 
sounds. Analysis of these patterns is 
simplified if we assume that at those 
points where the excitation patterns of 
the tone and noise overlap, whichever 
pattern has the higher excitation level 
completely suppresses the other. (A 
difference of 3 decibels would actually 
be required for complete suppres- 
sion.) In the figure an intense noise 
completely masks the higher fre- 
quency tone, but only partially masks 
the lower frequency tone, When 
the noise intensity is reduced, more 


Excitation level in decibels, 
ое 5 sg 8 уде 8 


© 


Tonalness in Barks 
Fic. 13, Excitation patterns for an 830- 


narrow-band noise at various SPI s, 


of the excitation pattern of the 
lower frequency tone remains sup- 
pressed than of the higher frequency 
tone. The size of the suppressed area 
indicates the degree of loudness re- 
duction, i.e., partial masking, so that 
it is clear that the moderate and soft 
noises partially mask the lower fre- 
quency more than the higher, This 
difference arises from the asymmetry 
of the excitation patterns toward the 
higher frequencies as examination of 
the patterns reveals. 

The model can also be used to cal- 
culate the loudness of the partially 
masked tones. Figure 13 shows the 
excitation patterns for the masking 
noise at various levels and for one of 
the tones. These patterns differ some- 
what from the standard excitation pat- 
terns; they were taken from the masked 
audiograms produced by the bands of 
noise used in the experiment, The same 
subjects made both the threshold and 
the loudness judgments. The loudness 
patterns for the 830-cps tone set at 
65-decibel SPL and Presented against 
five different levels of the narrow-band 
in Figure 14, These 
5 Were constructed by 
5 for the specific loud- 


noise are shown 
loudness pattern 
using the curve 


cps tone (solid curve) and a 
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Fi 
16. 14. Loudness patterns for an 830-cps tone (at 65-decibel 
The loudnes (The noise was centere 
hen the s patterns for the tone do not change until t 
noise SPL reaches 96 decibels, the tone is compl 


у à narrow-band noise. 


plan partial masking (Fig. 7) 
дь E the pure tone excitation 
ies а igure 13 to the specific loud- 
level Í us qaae The excitation 
the che к the noise pattern provided 
Chien es for Ly required in Figure 7. 
ere) à patterns (not reproduced 
a саре also constructed for the 830- 
at 980 à d 45 decibels and also for tones 
ais, va 1,355 cps at 45 and 65 deci- 
Ress рай, ues calculated from the loud- 
5 dimer are plotted for the tones at 
Mei 5 in Figure 15 along with the 
EO and loudness levels (Scharf, 
(The ШЫ ` the interquartile ranges. 
is similar t or the tones at 45 decibels 
and is th o that for tones at 65 decibels 
= ана not reproduced here.) 
calculated ment of the observed with the 
CDs ton values is best for the 980- 
centered where the tone and noise were 
[ече ы the same frequency so that 
Would te v in the excitation patterns 
Ment is nd to cancel out. The agree- 
у es for the 830-cps tone. The 
CPs tone is not as good for the 1,355- 
the data; although the general trend of 
a is reproduced. Since this tone 


was at a higher frequency than the 
noise, the excitation levels at the higher 
tonalness values where the noise’s ex- 
citation pattern had little effect on the 
tone’s pattern were especially impor- 
tant in the loudness calculations. But 
it is precisely at the higher tonalness 
values on any given excitation pattern 
that the levels are most uncertain, for 
the masking patterns upon which they 
are based are most variable there both 
within and among subjects. 

Given the difficulty of the loudness 
judgments and the many steps in the 
loudness calculation in this complex sit- 
uation, the model fares well not only 
in a qualitative analysis (Fig. 12), but 


also in the quantitative calculations. 


Perhaps, better calculations could be 
finements of 


obtained with further re 


the model. Nevertheless, using the 
model as it stands, let us turn to a still 
rrangement. 


more complex stimulus а 


Loudness of a Partially Masked Com- 


plex Sound 
In the same way that the loudness 
of a pure tone can be distinguished 
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Fic. 15. De Pure tone at 65 decibels on the SPL of a 
partially masking narrow-band noise, (The ordi € equally loud 


tone, but was presented in the 
T On the curves, Each Open circle is the 


ve subjects. Interquartile Tanges are also 
noise SPL required t 


excitation patterns like those in Figure 14.) 


from a neighboring band of noise, a 


| threshold) of the Complex, Even a 
multitone complex can be distingui 


А | у intense complex, which in 
rom | тане ү The loud- the quiet shows the greatest amount 
ness o а; -tone comp ex has been meas- ОЁ loudness Summation, decreased in 
ured against various levels 


of а uni- loudness а 
form masking noise (Scharf, 1961а). 


5 bandwidth increased when- 
Results showed loudness г 


ever an int i 


ola ense masking noise was 

emains in- simultaneously Presented. How does 
variant within the critical band. Be- our model account for this finding? 
yond the critical band, the changes in 1 


loudness depend Primarily on the sen- plex are assumed to rema: 
sation level (number of decibels above j 
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16 shows the excitation pattern for 
some of the complexes used in the ex- 
periment and for a uniform masking 
noise at an overall level of 55 decibels, 
which corresponds to an SPL of about 
40 decibels in each critical band. 
These patterns are combined from the 
standard excitation patterns measured 
on narrow bands of noise. The indi- 
vidual excitation patterns evoked by 
the four tones that comprise the com- 
plex stand out rather clearly when 
the tones are spread out as in the 
2,200-cps wide complex. 


SPL of complex © 


GOdb AF in cps 


Excitation level in decibels 


Tonainess in Barks 


ms 16. Excitation patterns for a 4-tone 

and ex at an overall SPL of 60 decibels 

ы ей at 2,000 cps. (Patterns for 

е ауа values of AF are shown, The 

unifor Ine 1s the excitation pattern for the 

jac m masking noise when the SPL in 
1 critical band was 40 decibels.) 


Hove the excitation patterns are 
a by the noise, the loudness 
Shown h may be greatly affected as 
Each f y Figures 17, 18, and 19. 
ims Pl presents the loudness pat- 
Tevel "3 the complexes in a different 
the а: noise. In Figure 17, where 
the LOS was inaudible at 15 decibels, 
he loudness patterns grow larger as 
complex becomes wider. Noise at 
Tens 45 decibels did not produce a 
terr able change in the loudness pat- 
with A this 60-decibel complex, but 
tibet overall noise level raised to 55 
much S: the patterns do not increase as 
П with AF; they increase still less 
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Fic. 17. Loudness patterns for four dif- 
ferent 4-tone complexes at 60-decibel SPL. 


(No background noise.) 


when the noise level was 65 decibels 
(Fig. 18). With the noise at 70 
decibels and the complex almost com- 
pletely masked (Fig. 19), the size of 
the loudness patterns, and therefore 
the loudness, actually decreases as AF 


increases. 
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masking noise 
=65db 
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Loudness patterns for the four 
ainst a uniform mask- 


Fic. 18. 


complexes presented ag Т К 
ing noise with an SPL of 50 decibels in 


each critical band (overall SPL approxi- 
mately 65 decibels). (Both the height and 
spread of the patterns are somewhat reduced 
by the addition of the noise background.) 
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Fic. 19. Loudness patterns for the four 
complexes presented against a uniform 
masking noise with an SPL of 55 decibels 
per critical band (overall SPL approxi- 
mately 70 decibels). 


Loudness diminishes with increased 
AF in the presence of intense noise 
because near the masked threshold, 
even more so than near the absolute 
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threshold, the specific loudness changes 
very rapidly with the excitation level. 
Consequently, as the excitation spreads 
beyond a single Bark, even a small 
decrease in the level of excitation within 
each Bark produces a large drop in the 
loudness pattern, A glance at Figure 
7, which is used to transform the exci- 
tation levels Ly to the specific loud- 
ness values N’, shows that near 
the masked threshold (approximately 
where the curves cross the abscissa) a 
reduction of only a few decibels in the 
excitation level over a single Bark 
causes a large reduction in specific 
loudness, Loudness summation ap- 
Pears then to be poorer in the presence 
of a noise because the Noise steepens 
the loudness functions for the compo- 
nent critical bands of the complex, and 


SPL OF STANDARD = 60db 


Overall 
SPL of 
Noise 


I5 db 


500 1000 
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Second — AF 


n (The SPL of the 
oise background was varie! 
ess matches by six subjects; 
values calculated from the 


1 2 loudn 
circles are the 
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the more intense the noise relative to 
the tone, the steeper these functions 
become. 

Now the question remains of how 
well the calculated loudness values 
agree with the experimentally meas- 
ured values. A direct comparison can 
not be made because the loudness levels 
of the complexes were not measured in 
the original experiment. Results were 
Siven in terms of the sensation level 
of an equally loud pure tone heard 
against the same noise background as 
the complex. Conversion of the cal- 
culated measures to the sensation level 
of the comparison tone in noise can 
te made with the help of data recently 
Published on the loudness relations be- 
tween a pure tone in noise and the 
a tone in the quiet (Zwicker, 
963). Figure 20 shows the sensation 
level of the comparison tone that 
Sounded as loud as the complex whose 
aa spacing is given on the abscissa. 

lie Open circles are the medians of 
12 Judgments by six subjects; the solid 
oe are fitted to the calculated loud- 

Ss values. In terms of the absolute 
ше; the agreement is not too good; 
litt calculated values are generally a 
lee high. However, both the 
the : ated and measured values. show 
пш Progressive decrease їп the 
mas of loudness summation as the 
until ki noise becomes more intense, 
both ke iia to the masked threshold, 
creasin s of values show loudness de- 

§ as AF increases. 
Summary of Calculations 
Meas respect to the three types of 
tween Н ишеп, the agreement be- 
Oudness - Observed and the calculated 
temas o must be considered 
Sounq "sed good. For each type of 
tions ® de Shape of the calculated func- 
Produces quite closely that of 


the measured functions. Although the 
absolute values may occasionally differ 
by many decibels, they are usually in 
good agreement, especially if we con- 
sider that the loudness judgments 
themselves were highly variable with 
the interquartile ranges for the various 
experiments averaging over 4 decibels. 
It seems unlikely, therefore, that any 
major modification of the model could 
eliminate the present discrepancies. 
As it now stands, the model provides a 
valid representation of the kinds of 
physiological processes which can ac- 
count for most of the psychophysical 
data on loudness and loudness summa- 
tion and which also agree with current 
theories of hearing (cf. Békésy, 1963). 


Some FINAL CONSIDERATIONS 


Our model does not try to deal with 
attentional, attitudinal, or other cen- 
tral factors that certainly affect the ex- 
perience of loudness. It is intended to 
apply to normal, unidirectional hearing 
under conditions of applied attention. 
Whether, under conditions of attention 
(such as sleep) that greatly alter the 
subjective experience of loudness, the 
model remains a valid representation 
of neural events occurring at some 
early stage in the nervous system, we 
do not know. The whole problem of 
the role of efferent innervation in the 
ditory mechanism is under intensive 
study (Davis, 1962); perhaps some 
clarification will soon be available. 
However, even if conditions in the 
cochlea turn out to be relatively con- 
we still do not know whether 
mmations and interactions pic- 
del are complete at the 
cochlear level. Indeed our ignorance 
may be even greater. Ades (1959) 
writes that, “for the moment, we can 
only exercise caution 1m theorizing 
about the (neural) mediation of loud- 


au 


stant, 
the su 
tured in our mo 
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ness - [p. 609].” Nevertheless 
“subjective loudness” is “usually д> 
tributed to the number of nerve im- 
pulses per second traversing the audi- 
tory nerve [Davis, 1959, p. 583]. 
While our model does not depend on 
this particular conceptualization (or 
any physiological theories or data for 
that matter), the specific loudness is 
perhaps most easily interpreted on a 
physiological level as dependent upon 
the number of neural impulses. Such 
a view served in the formulation of the 
model (Zwicker, 1958). 

Another underlying assumption that 
facilitates interpretation of the model 
is that loudness summation is primarily 
peripheral. This assumption receives 
compelling support from Niese's 
(1960) demonstration that loudness 
Summation takes place monaurally. 
Niese showed that the loudness of two 
narrow bands of noise increases as 
their frequency Separation increases 
beyond the critical bandwidth only if 
both bands are presented to the same 
ear. If the two noises are presented 
Separately to each ear, no increase in 
loudness with increasing frequency Sep- 
aration can be detected. Perhaps more 
direct evidence for the peripheral 
nature of loudness summation can be 
obtained from Studies of humans and 
animals with lesions at various levels 
within the auditory nervous System, 
Additionally, studies of interaural ef- 
fects in multidirectiona] hearing may 
provide interesting data, 

Physiologically the Critical band is 
more of an enigma than loudness, for 
we do not know where it is formed 
orhow. That it is formed upon stimu- 


mental instructions 
Such an inference i 


lation and is not already there as some 
fixed structural unit, something like a 
variable band-pass filter (cf. De Boer, 
1962), is clear. The critical band is a 
continuous function of frequency, not a 
step function—it may be measured at 
all the audible frequencies. Thus the 
critical band seems to form around the 
stimulating frequency as stimulation 
occurs. The formation of the critical 
band during stimulation suggests the 
necessity for a period of time during 
which the locus of stimulation is made 
known to the auditory system. Just 
such a temporal lag has been demon- 
strated by Scholl (1962) in measure- 
ments of the masked threshold for 
short pulses. The usual critical band 
does not appear until the duration of 
the pulse exceeds 10 milliseconds. Ве- 
low 10 milliseconds the critical band 
appears to become wider and below 5 
milliseconds matches the width of al- 
most the whole auditory spectrum. 
Perhaps the most likely process by 
which the critical band is formed is of 
an inhibitory nature, similar to that 
postulated by Békésy ( 1960) for the 
neural unit on the Skin and in the 
€ye. Tf the critical band is a measure 
of inhibitory processes, we may wonder 


why the masking pattern measured for 
a pure tone show. 
continuit 


are pri- 
Perhaps our 

Tve to guide research in 
this area as well as in the psychoacous- 
tical area. Whatever its advantages as 
а generator of new research, the model 
Provides a comprehensive summary of 


— 


any case no one has reported that the average 
critical band in loudness summation experi- 
ments varies in any Consistent fashion. 
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available psychoacoustical data on loud- 
ness and permits manipulations and 
calculations that lead to fruitful new 
Insights into the nature of the auditory 
processes, 
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CONTEMPORARY INSTINCT THEORY AND THE 
FIXED ACTION PATTERN' 


HOWARD MOLTZ? 
Brooklyn College 


The observable properties of the Fixed Action Pattern (FAP) as 
well as the processes assumed by the contemporary instinct theorist 
to underlie the structure and organization of this allegedly unique 
response class are described. An attempt is then made to assess the 
significance of the FAP for psychology. Finally, a critical evalua- 
tion of the FAP, in the light of a review of relevant empirical evi- 
dence, is presented. On the basis of this evaluation it is concluded 
that the empirical properties of the FAP do not require the assump- 
tions of genetic encoding and central itemization—assumptions the 
instinct theorist considers indispensable—but instead can be inter- 
preted in a manner broadly consonant with an epigenetic approach. 


go pie the many vicissitudes which 
d concept of instinct has undergone 
uring the course of its long and 
multifaceted history, its essential ap- 
Broach to the analysis of behavior has 
emained virtually unchanged: the 
Postulation of some form of invariant 
Senotypically fixed response nucleus 
around which the more flexible and 
ра components of a behavior гер- 
ie ig allegedly develop. Holding 
that the seemingly attractive promise 
fir such a nucleus would constitute 
E understanding the structure and 
it г пуна of а species’ repertoire, 
insti not surprising that the concept of 
Stinct has managed to remain viable 


1 
sop present paper was written while the 
M3855 pe in receipt of Research Grant 
ealt rom the National Institutes of 
salth, United States Public Health Service. 
Seren familiar with the writings of 
Peryasi а will recognize the profound an 
thinking. influence they have had on my 
velopme about problems of behavioral de- 
арсен in general and about the genesis 
КЕЕ responses in particular. It is 
© ШЫ to acknowledge what is indeed а 
ton are tH tedness. Equally worthy of men- 
he critical and penetrating insights 


de 


о ег, 
Which У Evelyn Raskin—insights from 
ably, "© Present paper profited immeasur- 
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despite repeated attempts to exorcise 
it from the body of science. The pur- 
pose of the present paper is to discuss 
and critically evaluate its most recent 
bid for redemption—a bid embodied 
in the form of a new theory of instinct. 

This theory, formulated by several 
European ethologists of whom the 
most notable are Lorenz and Tin- 
bergen, has as its core the concept 
of the Instinctive Movement or the 
Fixed Action Pattern (FAP). Based 
primarily, although by no means ex- 
clusively, on naturalistic observations 
of free-ranging species, the concept of 
the FAP has come to occupy a posi- 
tion of such central importance in con- 
temporary instinct theory that one can 
hardly discuss the current state of that 
theory without describing the observ- 
able characteristics of FAPs as well 
as the neurophysiological events which 
have been assumed to underlie their 
structure and organization. Accord- 
ingly, our first task shall be to de- 
lineate briefly those empirical and 
theoretical properties which have been 
assumed to render the FAP a distinct 
class of response events. (Detailed 
accounts and illustrations of a wide 
variety of FAPs are to be found in 
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Eibl-Eibesfeldt and Kramer, 1958, in 
Hinde, 1959, and in Moltz, 1962.) 
We shall then attempt, in the light of 
a review of the relevant empirical evi- 
dence, to assess the significance of the 
FAP as well as to evaluate the theory 
into which it has been incorporated. 
This evaluation will draw upon such 
valuable appraisals of instinct theory 
as those of Hebb (1953), Hinde 
(1954, 1959), Lehrman (1953, 1956), 
and Schneirla (1956, 1957). 

Before beginning, however, two 
points should be made explicit. First, 
because the contemporary instinct 
theorist often refers to himself as an 
“ethologist” and because he often 
speaks of ethology as having derived 
from the study of the FAP, there has 
been a tendency on the part of Ameri- 
can and European Psychologists alike 
to regard all ethologists as instinct 
theorists, Nothing indeed could be 
further from the truth. Many investi- 
gators call themselves “ethologists” 
simply to indicate that they are zoolo- 
gists engaged in the comparative study 
of animal they would 


only to in- 
For a detailed review 


‘orienting attitudes” of 
ethology and for 4 discussi 


On of its 
history in mediating Tesearch, see 
Hinde (1959), 

Secondly, 


the impression Should not 
be gained that all those who have con- 


tributed to the nexus of ideas and con. 
cepts referred to herein a 
тагу instinct theory” have remained 
fixed in their beliefs. There have been 
apostasies to be sure, most notably 
that of Tinbergen himself (1955; see 
also Hinde & Tinbergen, 1958) who 
has repudiated some of 
theoretical formulations, 

tion is made of this in w 


5 “contempo- 


his earlier 
That no men- 
hat follows is 


due simply to the fact that our task 
is not to examine the views of any 
individual theorist nor to discuss the 
way in which his views might have 
changed, but rather to evaluate a par- 
ticular approach to the understanding 
of species-typic behavior—an approach 
which continues to have a profound 
influence on the current thinking of 
those concerned with developmental 
problems. 


EMPIRICAL PROPERTIES OF тне FAP 


Although there is widespread agree- 
ment that FAPs comprise a unique 
class of responses, there is little agree- 
ment among ethologists in general or 
even among instinct theorists them- 
selves concerning the criteria requisite 
for class inclusion, However, each of 
the four properties which we shall dis- 
cuss below in attempting to delineate 
the empirical character of the FAP 
has been considered a distinguishing 
and primary feature and each has in 


turn posed rather important theoreti- 
cal issues, 


Stereotypy of the FAP 


A reasonably precise distinction is 
often made by ethologists and instinct 
theorists alike between appetitive be- 
havior and consummatory acts, “Ap- 
petitive behavior” is the term used to 
designate the labil 


e initial components 
of a behavior se 


quence, while “соп- 
Summatory act” ; 


а 15 the term used to 
designate the rigid and terminal com- 
Ponents (Thorpe, 


1954, 1956). 
ose movement 


S or movement pat- 
terns designated as FAPs are classi- 


1 Yped and constant in 
orm. Their most salient character- 


to be their stereotypy ; in 
deed, in this respect, they have been 
Compared with Morphological traits; 
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for, like such traits, FAPs are ex- 
pressed in a highly invariant manner 
within a taxonomic group (Lorenz, 
1956). 

As an example of this invariance we 
may consider an aspect of the repro- 
ductive behavior of the three-spined 
Stickleback. Aeration of the stickle- 
back nest, which is apparently neces- 
Sary to maintain the oxygen concen- 
tration of the surrounding water at a 
level appropriate for egg development, 
is accomplished through a movement 
called “fanning.” This consists of 
using the pectoral fins to exert a for- 
ward pressure on the water and simul- 
taneously using the tail and caudal fin 
to exert a backward pressure equal in 
intensity to the forward pressure. The 
Sequence of movements performed by 
these structures is the same on each 
occasion that fanning is exhibited 
(Baerends, 1957). Indeed, even un- 
der circumstances which apparently 
force the fish to adopt an atypical po- 
Sition in relation to the substrate, the 
Patterning or coordination of the com- 
Ponent response elements remains in- 
variant, 


Independence from Immediate Exter- 
nal Control 


" If a movement pattern is to be clas- 
Sified as an FAP, the temporal se- 
uence of muscular contractions com- 
Prising the pattern must be independ- 
ent of afferent regulation. Such inde- 
Pendence is evidenced by the fact that 
the movement will often continue to 
ae irrespective of changes in 
Sine conditions. This property 
н, to distinguish the FAP from 
elicit "ps termed "taxes" which once 
ко, continue to be directed by 
Cease { external to the animal _and 
us is © occur once the external stimu- 
Patt removed. The egg-retrieving 

‘rn of the graylag goose, described 


in detail by Lorenz and Tinbergen 
(1938), illustrates the difference be- 
tween the taxis and the FAP. 

A brooding graylag goose, observ- 
ing that an egg has rolled out of its 
nest, reacts in a characteristic man- 
ner. It slowly rises from the nest and 
approaches the displaced egg. Upon 
reaching it, the neck is extended 
downward and forward so that the 
undersurface of the bill comes to rest 
against the far side of the egg. Two 
distinct movements are then employed 
which serve to roll the displaced egg 
in the direction of the nest—a sagittal 
movement that keeps the egg rolling 
in the bird’s median plane, and a 
lateral or side-to-side movement that 
keeps it from deviating too far either 
to the right or left. 

The sagittal movement has been 
classified as an FAP since its form 
remains constant despite the irregu- 
larities of the terrain over which the 
egg is rolled and despite differences 
in the shape of those objects that have 
been experimentally substituted for the 
egg. Furthermore, if the egg rolls 
away from the bill, as occasionally hap- 
pens in the natural situation, the goose 
will often continue to perform this 
movement in the same manner as 
when the egg was present, indicating 
that the movement is apparently no 
longer under extrinsic control. 

The lateral movement is classified 
as a taxis, since it is both evoked and 
continuously directed by contact of the 
egg with the undersurface of the bill. 
Thus, if the egg deviates slightly from 
the bird’s median plane, a compensa- 
tory movement either to the right or 
to the left immediately restores it to 
its path. If an object that is unlikely 
to deviate (eg. а cylinder or light 
wooden cube) is substituted for the 
displaced egg; few or no lateral move- 
ments are performed ; finally, if the 
egg happens to roll completely free of 
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the bill, the lateral movement, unlike 
the sagittal, ceases. | 
Although the example just given 
illustrates quite clearly the distinction 
between the taxis and the FAP, it is 
difficult in many cases to determine 
whether a particular movement is di- 
tected by external events or is simply 
released by such events (Baerends & 
Baerends-van Roon, 1950). Further- 
more, the taxis and the FAP fre- 
quently occur either simultaneously or 
in close succession, such interlocking 
making differentiation additionally 
difficult (Lorenz, 1937 ). 
At this point it is appropriate to 
emphasize what has thus far been im- 
plicit in our discussion, namely, that 
the FAP does not possess the cha. 
teristics of a chain reaction, Although 
it may involve a relatively complex 


S, the 


Tac- 


ulus factors responsible 
Each component 
T words, can be 
same stimulus or 


stimulus complex as that which elicits 


the entire FAP. 
S; bontaneity 


A third Property of the FAP is its 
Spontaneity, a term used here to de- 
note fluctuations in threshold that are 
independent of changes in external 


conditions, The general rule observed 


demonstrated 
that when a male Stickleback is pre- 


nest. This is not due to changes in 
such extrinsic conditions as the ac- 
cumulation of carbon dioxide and 
other gases released by the eggs in the 
fish’s absence, since the same result 
was obtained when the experiment 
was repeated with the nest kept com- 
pletely covered. 

Fatigue has also been ruled out as 
а factor contributing to the fluctua- 
tions in the occurrence and intensity 
of FAPs. If one keeps a cichlid fish 
in close confinement with conspecifics 
So that stimuli evoking fighting are 
continuously present, the thresholds of 
Such agonistic movements as lateral 
tail beating, Spreading the gill mem- 
branes, and erecting the median fins 
are raised (Lorenz, 1956). Neither 
general fatigue nor fatigue of specific 
effectors can account for this finding, 
inasmuch as the fish will at the same 
time perform other activities involving 
the same effectors and apparently re- 
quiring just as much effort. 

The most dramatic bit of evidence 
regarding the Spontaneity of the FAP 
nder certain circum- 
im vacuo. That is, 


sponsible for its 
will sometimes be Performed in the ab- 


Sence of that stimul 1 
or example, уал 


nning is 


bsence of a nest. Simi- 
Orenz has called attention to 
the fact that the complex motor pat- 

the weaver bird's 
for its nest is some- 


in the absence of 
plant fibers that normally serve as the 


uum activities are per- 
Uenty and performed 
MS atypical for the spe 


formed infreq 
under conditio 
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eles; the tendency for less extreme 
variations in elicitability to occur as a 
function of the time clapsing since 
previous performance is as already in- 
dicated considered to be characteristic 
of all FAPs. Moreover, this charac- 
teristic is often emphasized in the at- 
tempt to distinguish it from reflexes 
or reflex-like movements (Eibl-Eibes- 
feldt & Kramer, 1958; Lorenz & Tin- 
bergen, 1938; Thorpe, 1954). 


Independence from Individual Learn- 
ng 


‚ A fourth criterion for membership 
in the class of FAPs is the exclusion 
of the possibility that the movement or 
acini gal pattern has been specifically 
еман prior to its first occurrence. 
St Rage (1956), for example, main- 
ains that even wide fluctuations in 
va vironmental conditions during on- 

Ogeny will in no way alter the FAP 
provided the health of the organism is 
not impaired. 

" The isolation technique is frequently 
а garded as the critical method for as- 
pow the influence of learning (Eibl- 

-ibesfeldt, 1961). In its simplest 
saat this technique consists of re- 
Es hee an animal at the time of birth 
Gece from members of its own 
ü ion and then determining whether 
ieu response is subsequently per- 
5 eus In a manner identical to that 
арна by animals reared with con- 
and oa Thus, Tinbergen (1942) 
stick үер (1960) report that male 
itn: | acks reared in isolation per- 

iore a typical zigzag courting dance 
stickleb, they had ever seen another 
on te ack; in fact it was performed 
introdu first occasion that they were 
grayi uced to a cardboard model of a 

avid female. 

i appearance of many movements 
орен ated as FAPs very early in on- 
the Y is also considered evidence that 

У could not have been learned in 


any ordinary sense of that term. Thus, 
the side-to-side head movement of the 
human infant is present at birth 
(Prechtl, 1958), and the gaping re- 
sponse of the thrush is present at hatch- 
ing (Tinbergen & Kuenen, 1939). 

Additional evidence that the FAP 
is dependent neither on individual 
learning or practice, comes from ob- 
servations that the FAP occurs even 
when certain structures involved in 
the movement are absent. For exam- 
ple, in one species of surface-feeding 
ducks, courtship involves a preening 
movement by which the drake exposes 
and wriggles a brilliantly colored terti- 
ary feather. Lorenz (1955) reports 
the case of a drake that had, for un- 
known reasons, failed to develop this 
feather but which nonetheless per- 
sisted in performing the preening 
movement. Similarly, FAPs can ap- 
pear in ontogeny long before they pos- 
sess functional utility and hence long 
before a (presumably) necessary con- 
dition for learning is present. For ex- 
ample, a young gosling in an aggres- 
sive encounter holds its adversary in 
exactly the same position in space and 
uses exactly the same beating move- 
ment as the adult goose despite the 
fact that its wings are not large 
enough to touch its opponent (Lorenz, 
1956). 

This last example reveals the nar- 
row conception of learning held by 
most instinct theorists. They reason 
that, if no source of reinforcement can 
be identified in connection with a par- 
ticular response, then the response 1S 
d (Eibl-Eibesfeldt 


perforce not learne 
& Kramer, 1958; Lorenz, 1955). But, 
despite their tendency to equate the 
absence of reinforcement with the ab- 
sence of learning, they have admittedly 
pointed to many aspects of animal be- 
havior which appear to be nonlearned. 
They have not, however, thereby dem- 
onstrated the reality of behavior ele- 
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ments that are uninfluenced during 
ontogeny by extrinsic stimulus condi- 
tions, or by what may be more sim- 
ply labeled experience. Subsequently, 
we shall have occasion to argue that 
learning and experience are not equiva- 
lent agents in the determination of be- 
havior. 


THEORETICAL PROPERTIES 
OF THE FAP 


In our delineation of the empirical 
characteristics of the FAP, it thus far 
has been treated solely as a descrip- 
tive concept. Now, however, we must 
turn to an analysis of some of the 
mechanisms or processes which have 
been inferred by instinct theorists to 
underlie the development and organi- 
zation of the FAP and which in turn 
explain its empirical Properties, 


Genetic Encoding 


stinct theorists believe that FAPs de- 
У of a specific em- 


ment, it does imply 
5 of the genic consti- 


nd others (е.р., 
been quite explicit 


ance. Bullock (1961) has Stated the 
instinct theorists’ Position on this point 
quite precisely : 

It seems at present h 
many complex behavior. 
System contains not o 


ighly likely that for 
al actions the nervous 
nly genetically deter- 


mined circuits but also genetically deter- 
mined physiological properties of their com- 
ponents so that the complete act is repre- 
sented in coded form and awaits only an 
adequate trigger, either internal or external 


[p. 55]. 


The recent work of Miller (1957), 
Andersson (1953), and Harris, 
Michael, and Scott (1958) has, of 
Course, demonstrated the close asso- 
ciation between neural loci and spe 
cific responses. But instinct theorists 
go much further, for they speak of the 
FAP as being not only centrally inte- 
grated but centrally blueprinted—a 
blueprint that is directly provided for 
in the growth process itself, And they 
contend that it is by virtue of the in- 
trinsically established blueprint that 
the FAP—as а temporally patterned 
system of response elements—can ар" 
pear without having been ontogenetl- 
cally organized. Indeed, since such 
Systems are considered to be geneti- 
cally specified down to the smallest 
detail, it is understandable that the in- 
fluence of experiential events in deter- 
mining their configuration and pheno- 
typic expression is held to be not only 
unnecessary but ineffective (Lorenz 
1937, 1956). nd it is equally utl 


derstandable, granting once more the 
specific encodj 


retains its distin" 
tive character in the behavior of thé 
adult organism, 


Action Specific Energy 

As alread 
theorists h particularly 10 
pressed with the Spontaneity of wi 
FAP, often regarding it as that pro?” 
heb, Which renders the FAP a uniqt? 
behavioral event. To Lorenz, the 


“Pontaneity suggested that each FA 
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Possesses its own source of energy 
which accumulates in that locus of the 
central nervous system responsible for 
its coordination. This accumulation is 
assumed to occur while the particular 
movement is quiescent and to be ex- 
pended when the movement is dis- 
charged. Tinbergen (1951) has also 
Proposed an essentially similar reser- 
voir-model of motivation involving a 
hierarchically organized system of 
neural centers responsible for the ac- 
tivation of a chain of functionally re- 
lated activities (e.g., those involved in 
reproduction). 

Of particular theoretical interest 
with respect to the reservoir-type 
model of motivation is the assumption 
by the instinct theorist that motor im- 
Pulses are endogenously produced in 
а specific center and that the basic 
Pattern of production not only occurs 
independently of afferent inflow but is 
in fact refractory to modulation by 
Such inflow, In maintaining that the 
Spontaneity of the FAP is a manifesta- 
tion of such a purely central auto- 
maticity, the instinct theorist points to 
Cardiac and respiratory activities, as 
Well as other basic functions, and con- 
Siders evidence regarding their spon- 
‘neity as providing clues for under- 
standing the neurological basis of com- 
plex behavioral spontaneity. 

In this connection, the studies of 
Adrian (1931), von Holst (1934), 
5 aymara. (1955) and others are fre- 
luently cited, Adrian and Buytendijk 
1941), for example, concluded that 

Piratory activity is endogenously 
ernerated after finding that potential 
b anges recorded from the isolated 
Tain stem of the goldfish exhibit 
eat same frequency as that of nor- 
» breathing movements. Maynard 
wee found that when the factors 
е h influence heart beat in the lob- 

* are held constant, a complex peri- 


odic pattern of impulses is still gen- 
erated by the cardiac ganglion. 

It obviously requires a long induc- 
tive leap to go from the comparatively 
simple functioning of an isolated seg- 
ment of the nervous system to the 
temporally organized behavior of the 
intact animal. But it is precisely this 
leap that the instinct theorist has 
taken. 


SIGNIFICANCE OF THE FAP 


Before attempting to analyze the 
psychological implications of the FAP, 
two sources of misunderstanding, each 
of which has led to some confusion 
concerning the significance of the FAP, 
must be clarified. In the first place, it 
should be apparent that the FAP is 
not to be identified with response pat- 
terns which have been traditionally re- 
garded as instinctive and which have 
usually been given such global designa- 
tions as maternal behavior, filial be- 
havior, migratory behavior, еіс. What 
must be emphasized is that each of 
these labels refers to a myriad of func- 
tionally related activities of which only 
a few might be both empirically and 
conceptually identical to the FAP. 
To the author’s knowledge, no con- 
temporary instinct theorist ever con- 
sidered maternal behavior, for exam- 
ple, to be innate in the same sense that 
the FAP is considered to be innate. 


Certainly, it is irrelevant to criticize 
either the reality o 


r the significance 
of the FAP by showing that p 


atterns 
like maternal behavior, or any pattern 
of such heterogeneous composition, 15 
experientially organized and conse- 
quently could be neither genetically 
encoded nor endogenously generated. 
Secondly, assessment of the FAP 
c twofold, since its value as an 
concept is independent of the 
validity of those hypothetical mecha- 
nisms assumed by instinct theorists to 
underlie its structure and organiza- 


must b 
empirical 
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tion. In discussing significance, we 
must, therefore, clearly distinguish be- 
tween the empirical and theoretical 
roles which the FAP plays. 


Significance of the FAP as an Em- 
pirical Concept 


As a class of response events, FAPs 
appear to be important empirically in 
at least three respects: taxonomically, 
evolutionally, and genetically. 

Consider first their taxonomic value. 
A behavior pattern that possesses the 
phenotypic properties of the FAP 
would seem ideally suited to serve as 
a classificatory device. The particular 
virtues of stereotypy and resistance to 
ontogenetic modification should make 
it possible for the FAP to provide tax- 
onomic information both of a diag- 
nostic and of an associative nature 
(Mayr, 1958). 

Consistent with this claim is the 
fact that the taxonomist no longer re- 
lies exclusively on morphological data ; 
indeed, when Structural and behavioral 
evidence lead to conflicting classifica- 
tory decisions, he is often inclined to 
give greater weight to the latter, And, 
relevantly enough, many of the re- 
Sponses thus employed appear to be 
phenotypically akin to FAPs, if not 
phenotypically identical with them. 
Crane (1941, 1957), for example, 
used certain display movements to 
classify species of fiddler crabs—spe- 
cies that are so structurally alike that 
a hand lens and a trained eye are re. 
quired to differentiate them anatomi- 
cally. Similarly, Hinde (1955), Lorenz 
(1958), and Tinbergen ( 1959) have 
used motor patterns involved in ag- 
gression and courtship to elucidate re- 
lationships among some closely related 
avian taxa, 

Secondly, the FAP can be used as 
an instrument in studying behavioral 
evolution. The morphologist inter- 
ested in Teconstructing phylogenetic 


histories has many available traits 
with which to work—traits that are 
quantitatively delineable, ontogeneti- 
cally stable, and interspecifically dif- 
ferentiable. The student of behavior 
who is devoted to the same pursuit 1s 
far less fortunate, since response traits 
suitable for evolutionary analysis are 
rare. Again the FAP should prove 
valuable. The readiness with which it 
can be discriminated from other com- 
ponents of a species’ repertoire, the 
degree to which it resists ontogenetic 
modification, and the manner in which 
it is distributed among related taxa 
(neither too conservatively nor #00 
divergently) make it ideally suited as 
ап object of phylogenetic study. 
Reports already in the literature 
Support this evaluation, Hinde an 
Tinbergen (1958), and Tinbergen 
(1954), for example, have offered 
stimulating evolutionary analyses of 
certain. display movements in birds 
and Baerends and Baerends-van Roon 
(1950) have been able to carry out 2 
parallel study for cichlid fishes. Simi- 
lar analyses have also been performe 
on many elements of invertebrate be- 
havior that appear empirically akin t? 
FAPs: Crane (1941) on the court 
ship response of fiddler crabs; Evans 
(1953) on the wing movements 9 
wasps; and Blest (1960) on the settling 
behavior of moths, to mention only ? 
few. 
‚ A third empirical consequence arts- 
ing from the discovery of the FAP $$ 
its utility in the study of gene-b€ 
havior relationships. As Fuller ane 
Ompson (1960) have stressed, 5 
ection оғ appropriate рһепоќур! 
measures is one of the most pressi"? 
and Significant Problems for the psy” 
chologist interested in behavior £ 


netics, Here also FAPs, because 9 
their environmental stability, the? 
stereotypy, 


: ? n^ 
their particulate and qu? e 


titatively delineable character, prov’ 


EN. 3 
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highly satisfactory analytic units for 
Psychogenetic research. Several at- 
tempts (eg., Dilger, 1959, 1962; 
Hinde, 1956; Ramsay, 1961) have al- 
ready been made to employ them in 
Just this capacity. 


Significance of the FAP as a Theo- 
retical Concept 


In its role as a theoretical concept, 
What possible contributions does the 

AP make toward increasing our un- 
derstanding of behavior? To answer 
this question, let us accept provision- 
ally the theoretical inferences which 
the instinct theorist has drawn and ex- 
amine some of their implications. 

It will be recalled that FAPs are 
Conceived to constitute the nucleus of 
?n animal’s response repertoire—a nu- 
Cleus that is encoded in the genome 
and that subsequently interlocks with 
” acquired elements of behavior. Al- 

Ough the relationship between these 
nuclear elements and the more com- 
Plex learned patterns of behavior has 
not as yet been elaborated for any spe- 
Clés, the instinct theorist nonetheless 
aS contended that analysis of а reper- 
oire into component elements—an es- 
*cntial first step in understanding the 
ый of any species—must proceed 
vli the genetically given to the indi- 
Pi acquired. It is considered lit- 
i5 J impossible to understand learn- 
us d example, without first under- 
whi ing the innate substructure upon 
jan it is based and by which it 15 1n 
ме affected (Lorenz, 1960; Tin- 

gen & Perdeck, 1951). . 
cal ne implication then of the theoreti- 
"a inferences that have been drawn 
ine the FAP is a policy for conduct- 
a xs esearch: the nuclear elements of 
;, “Sponse repertoire must first be dis- 


ting, 
а Тү. {һеїг properties analyzed, 
Ponents ; influence on all other com- 


S investigated. After comple- 
of these tasks, ontogenetic re- 


search should then be undertaken but 
only in relation to those response sys- 
tems designated as acquired. Obvi- 
ously, there would be no need to study 
the cumulative effects of organismic- 
environmental interactions on innate 
responses; genetically encoded and 
centrally blueprinted, their organiza- 
tion should not be influenced to any sig- 
nificant extent by experiential events. 

Another consequence of the FAP as 
a theoretical concept derives from its 
alleged capacity to provide an index of 
certain neurophysiological events. It 
will be recalled that each FAP is con- 
sidered to be centrally preformed in 
the sense that a genotypically fixed 
isomorphism is assumed to exist be- 
tween its phenotypic properties and 
the anatomical and physiological char- 
acteristics of its neural coordinating 
center. On this basis, instinct theo- 
rists have concluded that it should be 
possible to make relatively detailed in- 
ferences about the nature of such cen- 
ters from observations carried out at 
the molar level (Lorenz, 1960). 

The opportunity to proceed from 
the molar to the molecular in so direct 
and specific a fashion would undoubt- 
edly be of inestimable value in study- 
ing neural processes. A temporally 
integrated response pattern, isomor- 
phic with a localized encapsulated 
brain region, would provide an instru- 
ment, hitherto unavailable, for render- 
ing molar properties into physiological 
mechanisms. Indeed, even if we were 
convinced that the translation would 
not be simple, the very assumption 
of an isomorphic relationship of the 
kind postulated by the instinct theorist 
has profound empirical consequences— 
consequences quite different from those 
which would result if we were to as- 
sume that ЕАР, like most other tem- 
porally integrated responses, are prob- 
ably determined by a complex of func- 
tionally heterogeneous neural processes 
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which occur in anatomically diverse 
brain regions and which do not cor- 
respond either topologically or topo- 
graphically to the behavioral events 
they underlie. Thus, expecting func- 
tional congruency between the molar 
and molecular levels makes us prone 
to homologize all response elements 
having analogous functional character- 
istics. As Lehrman (1953) has pointed 
out, this results in an approach that is 
essentially anticomparative insofar as 
phenotypic resemblances tend to be ab- 
stracted from diverse phyletic levels 
and translated into identical physio- 
logical mechanisms without regard for 
differences in species capacities. 
There is a third theoretical conse- 
quence, still more far reaching in its 
implications for psychology than the 
two we have discussed thus far. If 
the FAP is a temporally integrated be- 
havioral entity which is uniquely or- 
ganized and with characteristic prop- 
erties specifically derived from this 
organization, then laws singularly ap- 
plicable to the FAP would be neces- 
Sary to accommodate the fact of its 
existence. Such a Prescription obvi- 
ously results in a search for novel 
functional relationships, and this is 
what the instinct theorist has evi. 
dently tried to do, On the one hand, 
he has attempted to show that re- 
sponses identified as FA ps cannot be 
subsumed either by the laws of learn- 
ing or by the laws of reflex action, On 
the other hand, he has made an effort 
to study what he considers to be in- 
nately determined perceptory mecha. 
nisms—mechanisms Which, upon being 
activated by specific environmental 
stimuli, presumably function to re- 
lease the FAP (e.g., Lorenz, 1960, 
1961; Tinbergen & Perdeck, 1951). 


Crrricar, EVALUATION or THE FAP 


Critics of instinct theory do not 
deny the reality of the FAP, nor do 


they belittle the importance of study- 
ing the FAP in understanding МП 
brate behavior. The questions which 
have been asked revolve first about 
the theoretical constructs used to ex- 
plain the FAP: do the po eget р" 
erties of the FAP demand the kind 0 
hypothetical neurophysiological events 
deduced by the instinct theorist or we 
these same properties be subsumes 
under what might be called an epi 
genetic approach? A corollary to this 
question is whether these species-typic 
responses require the formulation O 
special developmental laws or can 
they be integrated within a single de- 
velopmental theory that embraces ac- 
quired responses as well? 

We have already noted that the enr 
Pirical properties of the FAP have 
been interpreted to support conclu- 
sions concerning genetic encoding * 
central itemization. At the outset, ! 
must be emphasized that there is no 
evidence, either genetic or embryo" 
logic, which in principle contraindi" 
cates the possible validity of this a 
terpretation. Nor is behavioral : 
dence available which would establis” 
conclusively that responses designate 
as FAPs cannot be innate in the very 
sense in which the instinct theorist h4* 
used that term, It is, however, mean. 
ingful to inquire whether the empiric?” 
features of the FAP can in fact be er 
plained without recourse to propos! 


tions involving the idea of genetic pre 
formism, 


Since most 


subjected to ex 
cannot, 


FAPs have not beet 
perimental study, the 
by and large, be used to pol 
emplify the manner in which onto 
netic processes can function. in Hee a 
mining the structure and organiza 
of Species-type behavior, Our апа 


1 
Sis, therefore, will frequently draw oy 


а : : hic 
research involving responses W the 


crudely analogous tO nit? 
5 à consequence, the evide 
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and arguments presented cannot rule 
out the possibility that the instinct 
theorist may be correct; but it does 
Provide us with a reasonable and per- 
haps more cogent alternative for con- 
ceptualizing the FAP. 


Alternative Routes to Stereotypy 


Let us begin by trying to under- 
Stand how a temporally integrated re- 
Sponse exhibited in a virtually identi- 
cal manner by all members of a taxon 
could arise even if the response itself 
Were not genetically encoded. Of the 
рш of possible alternative sources 
Ог such stereotypy, the most obvious 
165 in the environment itself. 

Involved in the biotic province of 
any species are certain environmental 
Constancies which may function to 
Produce parallel responses in all nor- 
an members (Hebb, 1953; Lehrman, 

53). As Lehrman in particular has 
ee this homogenization can 
Бопе, either as a consequence of such 
con ancies „channeling ontogeny 1n 
н directions so that only cer- 
we evelopmental possibilities are re- 
-Zed or of their providing each indi- 
"rie With identical avenues for re- 
ten expression so that, even when 
са forms of the same response 
ts ie in the repertoire, only one 

qu to be manifest. 
the "d Harker (1953) reports that 
rhythe ly stereotyped diurnal activity 
doss m exhibited by the adult mayfly 
Orgar not occur unless the individual 
east usm has been subjected to at 
Prior Ew 24-hour light-dark cycle 
Period О the termination of the larval 
and ‘ce also Brown, 1959). Thorpe 
ea (1937) showed that the re- 
le Fes of the flour moth by 
Big p ichneumon fly, Nemeritis 
tio, Cens, for the purpose of oviposi- 
nificantly changed in Nem 
On a "spring when they are reared 
егеп host. Apparently, host 


selection by an adult is in part a con- 
sequence of exposure to the specific 
chemical and nutritional milieu which 
Nemeritis experiences while in a pre- 
imaginal stage. And finally, Van der 
Kloot and Williams (1953a, 1953b) 
have pointed to characteristically in- 
variant environmental factors which 
influence the cocoon-construction of 
Cecropia silkworms. Of special inter- 
est here is the fact that, when the 
caterpillar of this lepidopteran species 
is deprived of the usual physical sup- 
port (an upright twig crotch) on 
which to spin its finely tapered cocoon, 
it will use spinning movements which 
are quite atypical, resulting in a cocoon 
markedly distinct from the normal. 

The anatomical and functional prop- 
erties of peripheral structures also 
provide a possible explanation of be- 
havioral stereotypy, for such struc- 
tures can function directly to render 
specific response patterns invariant. 
Perhaps the clearest exemplification 
of the detailed interrelationships which 
can exist between response elements 
and associated peripheral mechanisms 
is provided by the work of Davis 
(1957) on the spotted and brown 
towhees. These birds forage for food 
by using a scratching movement which 
is performed in a highly stereotyped 
but essentially different manner in each 
species. Scratching by the brown tow- 
hee occurs with a backward thrust that 
is noticeably less powerful than that 
manifested by the spotted towhee and 
incorporates a pronounced lateral on 
ponent that its congeneric counterpart 
does not exhibit. Davis has argued 
convincingly that interspecific differ- 
ences in the expression of the va 
ing movement are directly determinec 
by characteristic differences x ha 
osteology and myology of the hind- 
limbs. 

In the present con 
tant to emphasize 


text, it is impor 
that functionally 
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similar stereotyped responses need not 
be governed by the same causal events, 
as illustrated by two instances of in- 
variant behavior occurring at widely 
different levels of the phyletic scale. 
The jellyfish, for example, exhibits 
certain characteristic feeding move- 
ments which appear to be a direct con- 
sequence of the functional properties of 
its medusoid nerve-net system and the 
spatial arrangements of its tentacles 
and manubrium (Maier & Schneirla, 
1935). The great tit (Paris major) 
also exhibits certain species-typic feed- 
ing movements but, in contrast to the 
jellyfish, these movements develop 
over many ontogenetic pathways and 
involve more complex interrelation- 
ships between intrinsic and extrinsic 
factors, 

Thus, as Schneirla has repeatedly 
emphasized (e.g., 1946, 1956), stereo- 
typy can arise from causes which op- 
erate directly and are immediately ap- 
parent or from causes which are con- 
siderably more subtle and involve the 
Operation of disparate ontogenetic 
mechanisms, Our discussion of envi. 
ronmental constancies and structural 
equipment as conceivable 
stereotypy does not mean 
that one or the other of tl 
is involved in all instance 
ioral rigidification or that 
of influence is always the 
only may there be addition; 
origin but it is also imposs 
eralize with Tespect to the 
which any one of them o 
homogenize different respon 
at different phyletic levels. 


Innateness amd the 
tween Learning and 


hese factors 
5 of behay- 
their mode 
Same. Not 
al modes of 
ible to gen- 
manner in 
perates to 
se patterns 


Distinction be- 
Experience 

Let us turn now to consider another 
characteristic of the FAP—independ- 
ence from individual learning—and at- 
tempt first to see what this criterion 
means and then to determine whether 


independence as such necessarily en- 
fails the assumption of innateness. 

As we have already pointed out, the 
instinct theorist seems to conceive of 
learning in a way that is entirely too 
restrictive. When a conventional re- 
inforcing agent does not seem to be 
involved, he is likely to assume that 
learning could not have occurred. In 
view of the controversial status of the 
reinforcement issue in learning theory 
itself, the absence of reward can hardly 
be considered as prima facie evidence 
for the absence of learning. But even 
if we broaden the classificatory cri- 
teria for distinguishing the learned 
from the nonlearned and even if we 
were to grant that these criteria may 
change as our understanding of the 
learning process expands (Beach, 1955; 
Verplanck, 1955), it still seems rea- 
sonable to designate as nonlearne 
those responses whose development 
cannot be explained in terms of any 
extant learning paradigm. Viewed in 
this way, the FAP can be classified— 
at least Provisionally—as а nonlearned 
response, 


The question whi 


ch remains to be 
answered is 


Whether genetic encoding 


actions not presently 
the scope of learnin£ 
the structure and organ 
avior in specific and m 
(Schneirla, 1956, 1957; 
osenblatt, 1961). 
Schneirla has used the term "ex, 
Perience” to denote a wide range 0 
stimulative involvements which deter” 
mine trace effects varying in com 
Plexity from Specific physicochemic? 
changes in Somatic tissue to gener? 
functional integrations in the centr 
nervous system, As he goes on 


included within 
can influence 
zation of beh 
portant Ways 
Schneirla &R 
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point out, the manifestations of such 
effects, in turn, can represent very 
different levels of behavioral organiza- 
ton. Thus, compare host selection in 
Certain arthropods as influenced by 
the composition of the medium on 
Which the larvae are fed (Thorpe & 
Jones, 1937) with the influence of 
carly patterned light stimulation on 
form discrimination and interocular 
transfer in vertebrates (Chow & Nis- 
Sen, 1955; Riesen, 1951, 1958, 1960). 
Again, contrast the reciprocal stimula- 
tive associations ("trophallaxis") ap- 
Parently essential for the organization 
of certain insect colonies (Schneirla, 
1946) with the development of avoid- 
ance responses in mammals as effected 
by isolation from conspecifics (Mel- 
zack & Scott, 1957). 
ach of the cases just cited entails 
Ontogenetic processes which, currently 
in least, аге not describable as learn- 
rie but which nonetheless depend on 
me type of interaction between the 
Organism and its sensory environ- 
ment. То clarify the distinction in- 
volved here, let us consider one exam- 
m In detail Recent studies (Hebb, 
race McBride & Hebb, 1948; Mel- 
sis 1952; Riesen, 1961) of the gene- 
ig emotional behavior in verte- 
Е = indicate that intense fear can 
Gus ted, by strange, although in- 
die visual stimuli. What is nec- 
ар for this development is сот- 
"fiti With a structured sensory en- 
Ver fant commerce in which the 
o У tact of exposure seems sufficient 
i; CStablish the visually familiar. It 
he ent that the animal does not 
miliar d learn to discriminate the fa- 
— rom the unfamiliar nor does it 
ing e undergo avoidance condition- 
Which aire it comes to fear stimuli 
ready iffer markedly from those al- 
tact dno Once early con- 
Prese 1115 the perceptually typical, 
ntation of an incongruous stimu- 


n 


lus combination is all that is necessary 
to evoke an emotional response. 

Admittedly, the term experience is 
difficult to define precisely, but some 
such term is evidently needed to desig- 
nate instances, such as those just cited, 
in which involvement with the onto- 
genetic milieu operates to structure 
behavior through channels which do 
not appear to depend on learning as 
currently conceived. 

We have already mentioned the iso- 
lation technique as a means for assess- 
ing the influence of learning. What 
about its value in assessing the influ- 
ence of experience? There is no 
doubt that, if proper control condi- 
tions are employed, the isolation tech- 
nique can provide evidence as to the 
importance of social interaction as a 
developmental variable. When Cullen 
(1960), for example, showed that 
male sticklebacks reared in isolation 
performed the typical zigzag “court- 
ing" dance on the first occasion that 
they were presented with a model of 
a gravid female, she thereby demon- 
strated that, whatever else is neces- 
sary for the ontogenesis of the dance, 
experience with conspecifics is not. 
But to say that the zigzag dance does 
not develop out of the interaction with 
species members ds obviously not 
equivalent to asserting that the “whole 
*blueprint' of а specifically structured 
receptor and effector apparatus is con- 
tained in the genoma [Lorenz, 1961, 
p. 183].” This unwarranted deduc- 
tive leap—one which the instinct theo- 
often makes on the basis of an 
isolation experiment—results from the 
failure to realize that an animal re- 
moved from the company of species 
hers is not necessarily removed 
from other nonsocial environmental ef- 
fects which in themselves might be 
sufficient for the organization of the 


observed behavior pattern. . 
Indeed, with respect to the genesis 


rist 


meml 
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of certain response patterns, the isola- 
tion and normal situations may not be 
significantly different in that they may 
both provide conditions guaranteeing 
the occurrence of exactly the same 
critical experiential involvements. As 
Lehrman (1953) has put it, “The im- 
portant question is not ‘Is the animal 
isolated?’ but From what is the ani- 
mal isolated? [p. 343].” For example, 
there is compelling evidence (Birch, 
1956; Schneirla & Rosenblatt, 1960; 
Steinberg & Bindra, 1962) that, in the 
rat and cat, certain self-stimulative 
Processes, particularly those involved 
in licking the posterior parts of the 
body and ingesting vaginal fluids, are 
implicated in the development of ma- 
ternal behavior. Separation from con- 
specifics could hardly be expected to 
inhibit the occurrence of Such proc- 
esses any more than the presence of 
conspecifics would be expected to fa- 
cilitate their expression. 

We do not wish to impugn the value 
of the isolation technique nor to deny 
the importance of determining whether 
ОГ not social interactions is critical for 
the organization of 
Sponses. We merely 
the obvious fact that 


"not be consid- 
ion to the ques- 
rticular response 
encoded or ex. 


Independence 


from. Immediate Affer- 
ent Control a; 


d Neurogenesis 

It will be recalled that freedom of 
the FAP from immediate afferent con- 
trol has been adduced to support two 
theoretical Propositions: that the com- 
ponent response elements of the FAP 
are centrally coordinated by virtue of 


genetically organized neural circuits, 
and that the threshold fluctuations 
characteristic of the FAP arise solely 
from the accumulation and discharge 
of endogenously generated motiva- 
tional impulses. 

The problem of how the € 
System becomes structured to suppor 
the orderly relations among the con- 
Stituent parts of a movement pattern 
is not to be confused with the problem 
of how central coordinative mecha- 
nisms, whether maturationally organ- 
ized or experientially established, par- 
ticipate in energizing such patterns. 
The two problems are obviously dis- 
tinct and warrant separate considera- 
tion. We shall treat the first of these 
in the present section. 

To begin with, it should be е 
sized that no contemporary student O 
behavior would maintain (contrary tO 
the contention of Lorenz, 1960) that 
the embryonic growth process simply 
leaves an equipotential homogeneous 
network out of which every neurona 
linkage must subsequently be forge 
by training or experience. At the very 
least, all would agree that the basic 
architecture of the nervo 
determined by intrinsic 
forces—in other words, 
tion of nuclear centers 
graphical arrangement 
become established 
in species- 
1958), 
the role i 


us system 15 
development? 
that the loca 
and the top” 
of fiber tracts 
during embryogeny 
typic fashion (Sperry, 195 : 
Ut for the instinct theori5 
nherited structures play P 
the organization of the FAP is highly 
Specific—there are coordinating struc 
tures in the brain and spinal cord i 
taining genetically patterned een 
Which determine the FAP down to ! - 
smallest detail, For the critics, in co? 
trast, the intrinsic contribution is ve" 
much less determinate and, with t 
Possible exception of mechanisms Ч? 
derlying certain simple reflex reactio?^' 
does not include neuronal anlagen 5" 
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ficient in themselves to provide for the 
expression of any temporally inte- 
grated response. Rather, the growth 
Process, in establishing the general 
structural and functional plan of the 
nervous system, is conceived to pro- 
vide a substrate against which neural 
configurations subserving coordinated 
behavior gradually become refined and 
claborated through environmental in- 
rapidis a substrate, in other words, 
к essentially determines the direc- 
lons and limits of a response reper- 
foire but not its details. 
v this distinction in mind, we 
hes dae turn to the theoretical issue 
from? by evidence of independence 
immediate afferent control. When 
| response is released but not guided 
et se stimuli, it is reasonable to 
ма that an organized central sys- 
has been activated and that the 
eon of this system influence the 
tia character of the observed 
jaan | It is quite likely that such 
ystems underlie the FAP, for, in ad- 
oat to the presumptive evidence 
rn ied by field observations, there 
experimental results which sup- 
(19 the same conclusion. Thus Hess 
ong! and Hess and Brugger (1943) 
ia DRIN that, in the cat, 
ie osi electrical stimulation of di- 
es a loci can evoke discrete be- 
eatin ге items normally involved in 
cen КЫ ‚ Sleeping, and fighting. More re- 
1962)" von Holst and St. Paul (1960, 
broad have been able to produce a 
atin Tange of movements by stimu- 
the ae variety of localized points m 
hiven stem of domestic fowl. The 
and не elicited were species-typic 
exhibited 1 ally identical with those 
Pecking in caring for the young, 1m 
Tho and in flight from enemies. 
the е theorist considers data 
Mas just cited to indicate the 
Mined o of endogeneously predeter- 
neuronal systems sufficiently 


organized to integrate the component 
elements of the FAP into an orderly 
spatial and temporal amalgam. How- 
ever, no really critical evidence has as 
yet been adduced in support of this 
possibility. 

First, the fact that punctate stimu- 
lation can effect the discharge of an 
integrated response is in itself not 
decisive with respect to determining 
the nature of antecedent neurogenic 
events. Miller (1957, 1958, 1960), 
for example, was able to elicit a previ- 
ously learned bar-pressing response in 
satiated rats by stimulating ventro- 
medial nuclei in the hypothalamus. In 
this case, it is obvious that extrinsic 
induction rather than intrinsic self- 
differentiation was the primary organ- 
izing agent underlying the observed 
behavior. Secondly, the research we 
have cited involving the “release” of 
responses which have at least some of 
the phenotypic properties of the FAP, 
has employed only adult animals and 
consequently cannot rule out the pos- 
sibility that the underlying neuronal 
configurations, instead of being ge- 
netically blueprinted, required experi- 
entially instigated refinement before 
achieving the degree of functional spe- 
cificity necessary to support response 
expression. 

Finally, as both Gloor (1954) and 
Lehrman (1953, 1956) have empha- 
sized, the instinct theorist has tended 
to overinterpret the work of Hess and 
his associates. The diencephalic areas 
they stimulated did not exhibit the 
characteristics of self-contained еп- 
capsulated units awaiting only ade- 
quate releasing stimuli before dis- 
charging their responses. Rather, the 
effects of stimulation depended partly 
on the nature of concurrent afferent 
inflow, the form of the response as 
well as its latency of release being 
modified accordingly. This, of course, 
does not corroborate the view that 
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autogenous differentiation is sufficient 
to provide for the coordination of the 
FAP. Indeed, whatever the extent of 
their genetically induced organization, 
it is possible that these areas must be 
continuously directed by information 
coming from cortical and somatic 
sources in order to support temporally 
integrated behavior. Whether the 
same interpretation can also be of- 
fered in relation to the work of von 


Holst and St. Paul remains to be de- 
termined. 


Response Elicitability and Action-Spe- 
cific Energy 


That a temporally integrated behay- 
ioral pattern like the FAP should 
seem to exhibit an activity-specific 
rhythm of exhaustion and recovery 
has posed challenging problems for all 
students of behavior. As already in- 
dicated, this rhythmicity has led the 
instinct theorist to prop 


pose a reservoir 
model of motivation and, in addition, 
to conjecture that the underlying proc- 


ess must be central in origin and neu- 
rologically akin to the evidently endog- 
enous activity shown by the cardiac 
ganglion of the lobster and the brain 
stem of the goldfish, 

ne can never be certain, however, 
even when working with isolated 
nerve-cell Preparations, that the enyi- 
ronment is not partly responsible for 
the pattern of excitation that is pres- 
ent (Adrian, 1950). All that one can 
be sure of is that a particular neura] 
rhythm is not the result of a corre. 
sponding afferent rhythm. However, 
there is a great deal of evidence which, 
although | not absolutely conclusive, 
nonetheless makes it reasonable to 
infer that the invertebrate as well as 
the vertebrate nervous System is con- 
tinuously active, generating patterneq 
impulse-sequences that arise neither 
from phasic nor from nonphasic sen- 
sory input (Bullock, 1961; Prechtl, 


1956). But to grant the validity of 
this inference is not tantamount to 
consenting to the instinct theorist's at- 
tempt to relate such endogenously de- 
termined impulse sequences to fluctua- 
tions in the elicitability of the FAP. 
There are simply no neurological data 
which convincingly support the idea 
that the motivational dynamics of the 
FAP, or of any other temporally inte- 
grated movement pattern for that mat- 
ier, are controlled and regulated by 
the automatic production of response- 
specific impulses. ich 
To illustrate the inductive gap whic 
in fact exists here, consider the type 
of evidence to which the instinct theo- 
rist has repeatedly appealed. was 
(1941) implanted a piece of We 
Cord and a limb bud onto the back a 
an amblystoma embryo and found tha 
movement occurred as soon as efferent 
fibers made contact with the limb de- 
spite the fact that afferent connections 
had not yet been established. Al 
though this result does provide dra- 
matic evidence of neural automaticity: 
it certainly does not show that thé 
Source of energy for a response pat- 
tern as complex as the FAP E- 
neurologically akin to the innervatio? 
of a limb bud nor even that coordi- 
nated locomotion can arise indepen 
ently of sensory input. In relation b 
the question of Coordination, it shou! 
be emphasized that the movement? 
eiss observed Were only spasmoe” 
or Paroxysmal, hence, differing mar 


edly from the smooth ambulatory 
rhythm normally evinced by the ad! 
amblystoma 


tern of the 
cation, for а 


(also see Gray, 1950), er^ 
Course, the status of the re$ 
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voir model of motivation does not rest 
solely on the neurological speculations 
advanced by the instinct theorist. The 
behavioral implications of the model 
are considerably more important and 
it is upon them that a definitive evalua- 
tion must be based. 
_ The reservoir model makes reduc- 
tion in elicitability contingent upon re- 
Sponse expression since it holds that 
the discharge of action-specific energy 
can occur only through the perform- 
ance or execution of the response. On 
this basis, it seems appropriate to ask 
whether reduction in elicitability nec- 
essarily requires response performance 
and, correlatively, whether a behav- 
‘oral model involving the accumula- 
tion and release of action specific en- 
ergy is generally applicable. 
, Precht! (1953) found that the gap- 
ing response of young chaffinches can 
be elicited by visual, acoustic, and vi- 
matey stimuli. He then discovered 
at the exhaustion of gaping by re- 
лец presentation of опе stimulus 
left the nestling relatively unaffected 
in its readiness to gape to the other 
ы, Hinde (1954, 1960a, 1960b) 
m also reported similar results in re- 
а. to other responses having the 
aor characteristics of the FAP, 
has emphasized that, for such re- 
пов at least, stimulus-specific satia- 
ma rather than the discharge of re- 
i Nse-specific energy appears to be 
е mechanism effecting reduction 1n 
elicitability, 
"hs reservoir model must also be 
^ " ered inadequate because it fails 
time oe the fact that FAP avail- 
© ы сап сһапре аз а function of 
Which types of afferent relationships 
are perhaps even more subtle. 
fon event case here is the incuba- 
behavior of the black-headed gull 
* por 1961, 1962; Moynihan, 1953). 
i ie tling on a clutch of eggs, these 
perform a sequence of move- 


ments which include lowering the 
chest, waggling the tail from side to 
side, and rapidly quivering the body. 
Of particular significance as Beer has 
pointed out, is the fact that the fre- 
quency with which this sequence oc- 
curs is related not to energy consump- 
tion, but in part at least, to thermal 
effects consequent upon settling. More 
specifically, if the pattern of sensory 
stimulation received from the eggs 
fails to satisfy the temperature re- 
quirements of the brood patch, and 
probably the requirements of other 
structures as well, then the bird will 
begin again and repeat the entire set- 
tling sequence. In fact, it can be made 
to do this many times simply by de- 
creasing the normal clutch number or 
by causing the eggs to become disar- 
ranged. 

In addition to stimulus effects of 
the kind we have just discussed, brief 
mention might also be made of the 
role played by the absolute level of af- 
ferent input. Hebb (1955) has drawn 
attention to the significance of the 
brain-stem reticular system, propos- 
ing that its innervation by peripheral 
inflow induces a general arousal state 
which influences all behavior regard- 
less of the specific control mechanisms 
involved. The reticular formation, 
functioning in the capacity of a non- 
specific activator, could then produce 
widespread threshold changes. If this 
in fact proves to be so in relation to 
the FAP, then the reservoir model 
would have to accommodate not only 
the influence of specific stimulus fac- 
tors but the influence of general ones 
as well. А 

Tt seems clear from the evidence 
cited thus far that reduction in the 
elicitability of an FAP is not neces- 
sarily contingent on its performance. 
On this basis, it is reasonable to con- 
clude that the reservoir model, involv- 
ing as it does the concept of response- 
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effected energy discharge, does not 
offer a framework comprehensive 
enough to integrate all instances of 
threshold change in the FAP. 


Aw EPIGENETIC APPROACH 
TO THE FAP 


The above analysis indicates that 
the theoretical position of the instinct 
theorist is not in fact demanded by the 
empirical properties of the FAP. In- 
deed, we are able to show that each of 
these properties can reasonably be in- 
terpreted in a manner broadly con- 
sonant with what may be called an 
epigenetic approach. In conclusion it 
would be well to attempt a general 
characterization of the major features 
of this approach. 

An epigenetic approach holds that 
all response systems are synthesized 
during ontogeny and that this synthe- 
sis involves the integrative influence 
of both intraorganic processes and ex- 
trinsic stimulative conditions. It con- 
siders gene effects to be contingent on 
environmental conditions and regards 
the genotype as capable of entering 
into different classes of relationships 
depending on the prevailing environ- 
mental context, In the epigeneticist's 
view, the environment is not benignly 
Supportive, but actively implicated in 
determining the Very structure and or- 
ganization of each response system. 

Contrast this view with the template 
conception of the instinct theorist 
which maintains that FAPs result 
from the virtually passive translation 
of genetic factors into phenotypic 
traits through the medium of tissue 
growth and tissue differentiation, Each 
FAP, in other words, is represented 
in the genome by functionally congru- 
ent genic agents which become trans- 
muted into underlying neural coordi- 
nating centers as long as conditions 
consonant with viability prevail. 

According to an epigenetic concep- 


tion of development, however, there is 
no adaptively organized response— 
species-typic or otherwise—that ex- 
hibits the genetic parallelism of which 
the instinct theorist speaks. Rather, 
the behavioral significance of genetic 
entities, and consequently of organic 
traits, is conceived to derive, in part 
as least, from the particular environ- 
mental context with which they be- 
come intermeshed during ontogeny. It 
is this coalescence of the endogenous 
and the extrinsic which, in an epige- 
netic view, makes each integrated re- 
sponse an emergent and which thus 
renders it gravely misleading to con- 
ceptualize maturational elements as 
functioning isomorphically in behav- 
ioral development. 
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Factor analyses of verbal personality measures have typically gener- 


ated highly complex multidimensional structural systems. 


Available 


evidence now suggests that the most dependable dimensions drawn 


from conventional factor analyses of ratin 
simple, familiar dimensions of broad seman: 
that most of the initially obscure, apparentl 
rowly defined factors many investigators c 
either trivial, artifactual, capricious, or all 
personality were reduced to 2 factors, 
to 2 ratings, 1 concerning perceived 


lated to introversion-extraversion 


lidities for the 2 simple ratings ar 


those for any of the more cumbersi 
and rating schedules examined in t 


ory and method were discussed. 


Of the several decisions required in 
the conduct of factor analysis, none 
has greater bearing on results than the 
number of factors to keep for rotation. 
When only a few "strong" factors are 
retained, the dimensions which emerge 
are broad in scope. Each is defined 
by a large set of variables, which may 
appear semantically heterogeneous. 
When many factors are retained, the 
dimensions which emerge are nar- 
Tow in scope. Each is defined by a 
relatively small number of variables. 
These may also be semantically het- 
erogeneous. When a few factors of 
high variance are rotated against a 
larger number of factors of low vari- 
ance, the former are rather narrowly 
fractionated and often lose their origi- 
nal identity. But if the many narrow 
factors are obliquely determined, and 
a second-order analysis is conducted, 
the original massive factors are essen- 


1An earlier version of this paper was 
read at meetings of the Society for Re- 
search in Child Development, Berkeley, 
California, April 1963. 
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gs and questionnaires are 
tic scope. It also appears 
y more precise, more nar- 
laim to have revealed are 
3. Verbal descriptions of 
and the 2 factors were reduced 
adjustment and the other re- 


Convergent and discriminant va- 
€ evidently equal or superior to 


ome and expensive questionnaires 
he review. 


Implications for the- 


tially reconstituted in the resulting 
second-order dimensions, 

Insofar as the smaller factors are 
more than spurious, we are dealing 
with an issue of factor “order” or de- 
gree of complexity, and mathemati- 
cally there is neither need nor method 
to make a choice. The factors simply 
vary in complexity. In general, any 
narrow factors can be merged into 
broader factors and any broad factors 
can be separated into narrower fac- 
tors. With appropriate manipulation 
of measures, this can be continued in- 
definitely in either direction, though 
within the limits of a given matrix, 
the process has to stop when a single 
general factor emerges at the top of 
the hierarchy, and instrumental spe- 
cifics appear at the bottom (Burt, 
1950; Humphreys, 1962; Vernon, 
1950). 

In the pursuit of research and in 
the construction of theory, however; 
choice of level is often desirable. The 
usual criteria for factor inclusion lead 
to the retention of large numbers 0 


—— — 
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factors and a complex structural sys- 
tem. It is the purpose of this article 
to suggest that better description can 
oe be attained with a very few 
ie - measured very simply. For 
к бс being, conclusions are те- 
Bk of to conventional factor analy- 
ters nun personality descriptions, 
ides *j goals are to define dimen- 
кен Е. personality, the data are 
Haines rom ratings and question- 
ABE "in the analyses are conducted 
dipl he bounds of a communality, 
mate cm model. The com- 
sheet ns follow may raise questions 
ins other domains, other media, and 
the models, but without further data 
y can do no more than that. 


Descriptive EFFICIENCY 


ao analysis was designed to 
di he parsimony in the description 
dices D phenomena. Descriptive 
бетп »4 15 therefore of primary con- 
а evaluating the utility of а 
iral system. 
us p ts and theoretical articles, vari- 
ordinar ^ for succeeding factors are 
мек dd represented by à smoothly 
tion ae gradually descending func- 
ine in н sort shown by the broken 
the fog, ar 1. If this were actually 
ecisions of the curve, nonarbitrary 
have “eg os factor inclusion would 
Strains based on some other con- 
ance oe than the amounts of vari- 
tors, поса by successive fac- 
ion id this is not the way the func- 
empirical 2 looks. In a number of 
le fme investigations, it has taken 
Fieure | shown by the solid line im 
Corists ; ‚ While Cattell and other 
lor to tal might expect the second fac- 
аз the ^" up almost as much variance 
Counte Pe the second factor actually 
агіапсе or less than half as much 
rise i as the first. There is a sharp 
| acceleration after the first two 
Tee factors and very low slope 


— — — Assumed variance 
function (from 
Cattell, 1958) 


Actual variance 
function (from 


X Peterson, 1960) 


MAGNITUDE OF VARIANCE 


о 4 8 12 16 20 
SUCCESSIVE FACTORS 


Fic. 1. Assumed and actual variance 
functions. 


after that. Under such conditions, it 
is inefficient to retain any factors be- 
yond the first few. 

]f one goes beyond that, it is diffi- 
cult to decide where to stop. The ad- 
dition of new factors and the division 
of old ones seems to be endless, and I 
believe we are already near the point 
of utter descriptive chaos in the pro- 
liferation of personality dimensions. 
In the field of human abilities, Guil- 
ford now has more factors than Thur- 
stone had tests (Humphreys, 1962). 
The situation in the personality do- 
main is even more terrifying. Cattell 
(1957a) alone has defined at least 45 
dimensions. Those are the ones he 
mentioned in his 1957 book and he 
left some out. Extracting large num- 
bers of factors in the face of facts as 
shown in Figure 1 defeats the pur- 
pose of factor analysis. Tf an investi- 
cator wants to deal with factors at all, 
he had better deal with factors which 
are reasonably efficient. If there is 
need to deal with more specific char- 
acteristics, factorization offers no ad- 
vantage. The number of dimensions 
one must eventually consider will be 
enormously large. Researchers might 
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as well examine the original elemental 
variables, and not bother with factor 
analysis at all. 


SUFFICIENCY IN FACTOR EXTRACTION 


Not only is the retention of many 
weak factors inefficient, at least for 
some kinds of data the smaller dimen- 
sions appear to be purely artifactual. 
Linn (1964) introduced 20 random 
variables among 36 other “true” vari- 
ables which had previously been ex- 
amined in a study of rated personality 
characteristics (Peterson & Cattell, 
1959). Results are shown in Table 1. 
The distributions of loadings for ran- 
dom variables become indistinguish- 
able from those for “real” variables 


when the number of factors exceeds 
two or three. Mean values derive 
from Table 1 show that the mean ab- 
solute loading for random variables 
actually exceeds that for empirica 
ratings for the third factor, and the 
mean loadings are approximately the 
same for random as for "real" vari- 
ables for all factors beyond the third: 
Retention of 13 factors, as was dont 


in the original research, is obviously 
out of the question, 


FACTOR INVARIANCE 


By “invariance” is meant a condi- 
tion of factor comparability such that 
factors in two or more studies remai? 
identifiably the same when the con” 
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ens [оси 15 unchanged ; the 
el ules for accepting representa- 
пе "a within that domain are 
empi m , but the actual measures 
and us | are altered in some way, 
nytitian aS subject sample is changed. 
to ано in this sense, is important 
parent = any factor analysis, and is 
facto arly important in the use of 
ger r analysis as a means of defining 
neral traits. 
тысы replication, the invariance of 
на an pE examined in a number 
ivoa a e most common procedure 
intuition combination of inspection, 
Vestigata, and preconception. The in- 
salient = gazes thoughtfully at the 
опе hb. ip les, and decides whether 
as iig appears" to be the same 
cording] r and should be labeled ac- 
Plicit, оз 4 due criteria are seldom ex- 
unknown, weights and interactions are 
femme objectity: and standardize 
€ similari ч аана SMS pre 
(Cattell ity have taken several forms 
1960; fy 1955; Cattell & Baggaley, 
Tucker ae 1960, pp. 256-259; 
low were d ). The data shown be- 
Columns of te by intercorrelating 
mon mark actor loadings over com- 
the y ker variables. In relation to 
5 Sual sort of correlation, the fac- 
„are analo siu] 
adings on gous to variables, and 
analogous common marker items are 
Jects, rc i Scores obtained by sub- 
9ne Qu the columns of loadings in 
olumns E are correlated with all the 
angular ne in another, a rec- 
the args can be formed, with 
OWS, and us one study heading the 
the actors in the other heading 
. If the factors have been 
identified, by inspection, 
; intuition, or otherwise, it 
M Studies 9 place the factors in the 
" lleged] in parallel order. Then, 
Me or matching factors are the 
Similar in fact, high 7’s should 


two °ssible 


appear in the diagonal, and low values 
everywhere else. 

Over the past several years, with 
the help of several colleagues and a 
very effective computer system, the 
author has evaluated the invariance of 
both broad and narrow factors from 
ratings of personality in children, rat- 
ings of clinical problems in children, 
ratings of parental attitudes, and ques- 
tionnaire responses which discriminate 
between delinquent and nondelinquent 
adolescents. In every case, consider- 
ing each set of factors as a whole, 
the broad group factors have shown 
greater stability from one study to 
another. The following example is 
chosen principally on the basis of ade- 
quacy of both studies, freedom from 
contamination of results, and only in- 
cidentally because the findings are 
slightly more favorable to the author’s 
case than some of the other compari- 
sons which might have been chosen. 

Some time ago; Cattell (1947) ob- 
tained ratings of young adults, ana- 
lyzed the data, and obtained 11 fac- 
tors which he identified and named. 

ous ratings of 7-year-old 


Later, analog r ar- 
children were obtained, a similar 
analysis was performed, and 13 fac- 


tors were identified. For the most 
part, these “appeared” to resemble the 
factors for adults, and the authors con- 
cluded that most of the factors previ- 
ously isolated in older subjects were 
found for the younger ones (Cattell & 


Сиз, © 2; ion of loadings, how- 


duced the matrix which ap- 
The correlations 1n 


asserting 
But the 7’s for al- 
ors are not im- 
for nonmatch- 
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TABLE 2 
{ " FACT ‘GS F ADULTS 
ELA S ETWEEN "NARROW" FACTOR LOADINGS FOR 
eae uc FOR 7-YEAR-OLD CHILDREN €— 
Adults 
Р А Е F © н 1 1 І м B » 
23 
3 5 8 51 1б хе 
.06 32 03 332 .20 .05 18 01 
- p: 3 | 38 | 21 | .29 | a0 | иу 07 | 24 i =. 
E32: | 25 | 9| о | 32 | 762 | д 02 | 28] 08 | of 
G 35 32 19 61 24 -04 .28 24 20 37 36 
H 01 .07 .29 25 -70 .08 18 14 04 Eu ET 
S I 23 .07 .06 32 16 41 48 15 01 a 25 
узе у | рш 01 |.22 | o1 |06 |37 | 17 | 07 10 | 25 
melli 41 148 | a8 | a9 | 37 | мо | оз LIE 
ams M 14 01 .28 21 22 07 .01 23 14 2 03 
€ „45. 15 11 18 ald .02 13 AT 3 05 17 
р 11 11 5 11 .04 35 09 .08 22 `$ 16 
L^ -09 09 AS 10 17 13 .01 .49 16 Ur 40 
26 27 449 27 00 10 E 19 33 E por 


well matched with another from which 
it is presumably distinct as with the 
one it is said to resemble, 

When the two high order dimen- 
sions, appearing to reflect “adjust- 
ment" and "introversion-extraver- 
Sion" were examined in the 
the figures in 


Correlations, for 
are low, 

Data from other con 
Summarized in Table 4. 
the values for matchi 
means of diagonal уа 


прагіѕопѕ are 
In this table, 
ng factors are the 
lues from matrices 


TABLE 3 

CORRELATIONS BETWEEN 
LOADINGS ров AD 
7-YEAR-OLD 


“Broan” FACTOR 
ULTS AND FOR 
CHILDREN 


7-year-olds 


ma e 


Adjustment |Extraversion. 
Justment introversion 


Adjustment 92 27 
Adults 
Extraversion- 
introversion 10 83 


of the kind shown in Tables 2 and : 
The values for nonmatching eoa 
are the means of all other carseat 
in each matrix, taken without rega 
to sign. 1 
The Correlations in the first ey 09 
of Table 4, for similar "strong re 
tors, tend to fall in the 80s, and a 
higher in each case than the ee 
for factors alleged to be different. 150 
figures in the first column are не 7 
higher than those in the third ed 
umn, which contains similarity inden 
for narrow factors which theoretict о 
ought to match, In fact the o 
values are not much higher than t an 
in the fourth column, for the Miis 
factors which are hypothetically 
tinct from one another. t 
Factor invariance in studies of 
Sort may arise from covariance. 
observed behavioral tendencies, "du 
ordinarily assumed by trait theor the 
r it may have more to do with as 
perceptual tendencies of observers» 


his 
in 


4 ч : ar 
will be proposed later in this es 
cle. €monstrations of invariant al 
nothin 


S to this point. 


г 
But struct, 
replicability is a high 


1 
desideratu”™ 
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TABLE 4 
AVERAGE FACTOR LOADING CORRELATIONS FOR BROAD AND NARROW FACTORS 
Broad factors Narrow factors 
Mean r for Mean r for Mean r for Mean r for 
matching | nonmatching | matching | nonmatching 
factors factors factors factors 
Personality factors 
Adults (Cattell, 1947) 
versus 11-year-olds 89 26 = н 
(Cattell & Gruen, 1953) 
Adults (Cattell, 1947) 
versus 7-year-olds B2 - - ч 
(Cattell & Coan, 1957) 
Adults (Cattell, 1947) 
versus 4-year-olds o - 2d ш 
(Peterson & Cattell, 1959) 
Parental attitudes 
merican mothers 
Versus Sicilian mothers dá а E Е 
Children’s problems (Peterson, 1961) 
ifth and sixth grades 555 22 
versus third and fourth grades i82 = = Е 
Fifth and six 
a xth grades 51> 
versus first and second grades St = " m 
Fifth and si 
S nd sixth grades 84 41 52 26 
rsus kindergarten р 


^ Thi i ч "—— 
known value involves maximal capitalization on chance. 


b 


facto Narrow factors in these three groups were defined in 


it | 
self, and the evidence clearly favors 
o ew strong factors over many weak 
nes in this regard. 


VALIDITY оғ VERBAL PERSONALITY 
MEASURES 


in it questionnaires and ratings used 
mer. DU measurement are indeed 
volved Е personality, the traits in- 
Stanc Should manifest some con- 
o ed Over methods, over situations, 
Case p extent over time, and in the 
Preeiapl ratings should display ap- 
i entit € stability with shifts in the 

Y of raters, If conditions of mini- 


T5 in the other comparisons of narrow factors. This probably exp! 


Matching was done after the loading correlations were 


matrices containing only 5 factors, instead of the 10-15 
lains the improvement in results. 


mal validity are to be met, the various 
intercorrelations between measures of 
a given trait ought to be higher than 
those between different traits as as- 
sessed by separate measures (Camp- 
bell & Fiske, 1959). It is also desir- 
able, if not absolutely essential, for the 
ween measures of a 


correlations bet of 
given trait to exceed those within 
methods and situations (Humphreys, 


960). 
1 It jt here, as Becker ДО) has 

‘ated out, that the multi actor meas- 
м of Cattell have failed most badly. 
In one of the studies Becker cites as 
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TABLE 5 


RELATIONSHIPS BETWEEN Q Factors AND PARENT RATINGS OF PARALLEL FACTORS 
Y Q Facrons (Self-Ratings) 


Feet! Ас | || 
ratings 
04 |—.14| .06| .02| .12| .22|—.07 
$ ~03| 03 -06| .08|—.05| :00|—.07 
[e 45| .22| .27|—23| .09| 113] 01|—02 
E 14| .08|—.06| .37| .10|—21| 42] —‘o4 
F 332| .09|—.07| .22| :26| .05| ;36| | 
G Aa) 47| 15|—29| .07| .18|—:02 | —)05 
H 126) .26| .09| .14| .18|—15| 51—15 
I -06 |—.06 | —.15| .27| .01|—207| 22|—7 
L —28| .09} .06| .15|—24| .o1| 01|—06 
M |—07)—.05/—09] :08|—31| ‘01 |} —‘os 
N 09 :..06| .28|—.02| .01|—:07| 10|: 
О | -901—48 —19| .02|—:02| 01| :07 1:08 
Qı |-12| .22| 102] 134) :03|—12| 05] : 
O0: |—10| :00| 04| :02|—106 —.19 |—.11| —.11 
Qi 30| 11] 25|—19| п4| <14 |212 | —:08 
Qe 1-14 |—17|—17| 25|—22| :08|—:02 | 25 


most favorable to Cattell's position 
(Meeland, 1952), the average correla- 
tion for “matched” behavior rating 
and questionnaire factors is .37, and 
the average for “nonmatching” factors 
is .32. Becker found other studies 
even less convincing, and concluded 
that claims of correspondence between 
questionnaire and rating factors were 
poorly supported by available evidence, 
Nothing in Cattell’s (1961) reply 
changed any of the facts. 

Whether higher order factors will 
offer any improvement in validity is 
not well determined at this time. The 
writer knows of only one study in 
which factors at both levels were ex- 
amined for factor Correspondence over 


TABLE 6 


RELATIONSHIPS BETWEEN QUESTIONNAIRE 
FACTORS AND RATINGS OF 
PARALLEL FACTORS 


Questionnaire f. actors 
(self-ratings) 
ена ИЩ 


Dynamic Introversion- 
integration extraversion 


Dynamic | 
integration 28 05 
Parent 
ratings Py 
Introversion- 


extraversion 10 38 


media. Wetzel (1963) administered 
the 16 P-F Test (Cattell, 1957b) to а 
class of undergraduate psychology stu- 
dents, and obtained peer and parent 
ratings on the dimensions the ques- 
tionnaire is supposed to measure. In- 
terrelationships were established for 
72 subjects. Parents ratings were 
made by the mothers for all cases 
where mother ratings were available 
(N=61). The remaining 11 parent 
ratings were made by fathers. Peet 
ratings were obtained from other stu- 
dents, elected by the subject’s them- 
selves as “people who knew them well 
in the university setting, and could 
give Teasonably accurate evaluations. 
Most of the subjects asked their room- 
mates to rate them, 

Illustrative findings for narrow fac- 
fors are shown in Table 5, The va- 
lidity correlations in the diagonal аге 
not much different from the remain” 
ing correlations in the matrix. 

Values for two second order factors: 

ynamic Integration (self-perceptio® 
of adjustment) and Іпігоуегѕіоп-Ех” 
traversion, are Shown in Table 6. I” 
this case, the correlations in the di 
agonal for the “same” factors are at 
least Statistically significant, and the 
off-diagonal values are not, 
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TABLE 7 


AVERAGE VALIDITY CORRELATIONS FOR BROAD AND NARROW FACTORS 


Broad factors Narrow factors 
Mean r for Mean r for Mean r for Mean r for 
“same” “different” “same” “different” 
factors factors factors factors 
Q factors versus 
peer ratings 3l 49 22 18 
О factors versus 
parent ratings 83 .08 17 12 
P E" 
eer ratings versus 
parent ratings 25 04 AT 12 


ae from other comparisons in 
he | у аге summarized in Table 7. 
s [Tall comparisons among the origi- 
нај muültitudinous primary factors, ше 
wh. " for matching" factors is .19, 
iig. is not reliably different from 
ee or an N of 72. The mean 7 for 
bien factors is .12, which is 
же] than the mean correlation for 
not etically related dimensions, but 
ee The mean r for matching 
E order factors is .30, which е 
ку the usual limit of statistical sig- 
areena, while that for hypothetically 
= ated factors is ло. The essential 
one = dimensional identity pro- 
theref y Campbell and Fiske are 
тыге poorly met by the original 
der fac factors. The broad second or- 
actors look somewhat better. 


Economy AND SIMPLICITY IN THE 
SSESSMENT OF VERBAL BEHAVIOR 
Pr me now press the reductionism 
ther os argument one step fur- 
ге ith considerable dependability, 
and вы factors of the sort Cattell 
order Fu postulate merge into second- 
айаны a which possess certain 
the Ъ ges over the primary factors. 
терг road group factors appear to 

esent rather diffuse concepts of 


self and others. Perhaps it is possible 
to specify these concepts within closer 
topical limits, and reduce some of the 
most important aspects of verbal per- 
sonality assessment to a couple of 
questions. As to theory, it is proposed 
that when people consider their own 
“personalities” or those of others, their 
statements revolve largely about such 
concepts as adjustment and extraver- 
sion-introversion. As to procedure, it 
is proposed that a good deal of the 
information people convey in verbal 
statements about themselves or others 
can be gained by asking them (a) 
*How well adjusted is X?" (where X 
may be the self or another person), 
and (b) "Where does X stand in re- 
gard to introversion-extraversion ?" 
Wetzel (1963) also asked all re- 
spondents, the core subjects them- 
selves as well as peers and parents, to 
rovide ratings on “adjustment” and 
k extraversion-introversion,” each on a 
simple 7-point rating scale. These 
were correlated with the second-order 
factor scores which had been pains- 
takingly calculated from performance 
on the 16 P-F test and from the rating 
schedules. The correlation between the 
rating of adjustment and the second-or- 
der factor “Dynamic Integration versus 
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TABLE 8 
MULTITRAIT-MULTIMETHOD CORRELATIONS FOR Two SIMPLE PERSONALITY RATINGS 
Self Self Peer 
Intro- 
Intro- Intro- : Eur 
Adjust- | version- Adjust- | version- Adjust- | version 
xtra- ment extra- men E 
mens Teran version version 
Adjust- 24 —09 Adjust- | 32 06 Adjust- | 25 12 
ment ment ment 
Peer| Intro- Parent| Intro- Parent| Intro- 
version- version- version- 
extra- _01 39 extra- 12 41 extra- 08 33 
version version version 


Anxiety" was 45, and the correlation 
between the rating of introversion- 
extraversion and the second-order 
questionnaire analogue was .6]. Com- 
Parison between the simple ratings and 


agnitude show 
considerable to 


Xpensive the 
easures may 
n of quality 
d information 
on concurrent validity for the two 7. 


point ratings is given in Table 8, 

All of the correlations for theoreti- 
cally related. dimensions are statisti- 
cally significant. None of the others 
are. The mean correlation for ratings 
Which ought to match is .33. The 
mean for hypothetically unrelated rat- 
ings is 08. This is the widest dis- 
crepancy between such values which 


has appeared so far. 
teria of convergent and discriminant 
validity, the two ratings therefore ap- 
Pear to be equal or superior to the 
considerably more lengthy and cum- 
bersome questionnaire, and the more 
complex and cumbersome rating sched- 
ules examined in this review. When 
considerations of cost are introduced 
(Cronbach & Gleser, 1957), the two 


ratings are vastly superior to the other 
devices, 


According to cri- 


Sta- 
bility correlations, over а 5-week span, 
with an examination and а vacation 
between the two ratings, were .61 for 
adjustment and 73 for extraversion- 
introversion. That is high enough for 
use in many kinds of research, and 
Perhaps that is al] any existing “per- 
Sonality tests” are good for, 

Reducing ve 
Complex and 


constructs, measured as efficiently aS 


Possible, anq elaborate as necessary: 
than to Start wi 
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Discussion 


For anyone interested in developing 
a trait theory of personality, correla- 
tions of .30 are no cause for rejoic- 
ing. Self-reports and the reports of 
others show very limited correspond- 
ence. The reports of different sets of 
judges have little to do with each 
other. The unities therefore appear to 
reflect something other than covaria- 
боп among observed behavior tend- 
encies, 
Among the many possible interpre- 
fative alternatives, one of the more 
Plausible is provided by the work 
of Osgood, Suci, and Tannenbaum 
(1957, 1962) on the measurement of 
meaning. When meaning structures 
are assessed through use of semantic 
differential ratings, three factors con- 
Sistently recur: (a) an evaluative di- 
mension which refers to the “goodness- 
adness” of objects, (b) an activity 
dimension, ie,  "activity-passivity." 
and (c) a potency dimension, i.e., 
Strength-weakness." It does not seem 
to matter who assigns the meanings. 
he factors have emerged in a num- 
ber of different societies, with raters 
Who vary in many ways, including de- 
gree of sophistication about the objects 
Perceived. Particular structures may 
vary somewhat with changes in the 
p of concepts, but the generality of 
* meaning dimensions over people 
Seems very well documented. 
. The invariant "personality" dimen- 
Stons discussed above are rather easily 
construed as topical variants of more 
i dat ways of attributing meaning to 
e Jects, in this case human objects. 
$ eiustinent” is good, “neuroticism” 
АД Ж Extraversion “means” strong 
fld on When judges evaluate pa- 
e attitudes (Schaefer, 1961), the 
ai ur dmi consistent factors are 
and ton (good) and control (strong 
active), When variance along 


the first evaluative dimension is at- 
tenuated by including only "bad" 
items, such as problems, the resulting 
factors may reflect only the attribu- 
tion, to self or others, of “dynamism,” 
ie. activity and strength, or passivity 
and weakness. Thus intercorrelations 
among clinical problems reduce to “con- 
duct problems” and “personality prob- 
lems” dimensions (Peterson, 1960). 
And factors among questionnaire items 
which discriminate between delinquent 
and nondelinquent boys reflect activ- 
ity-strength and passivity-weakness in 
the topical form Quay and Peterson 
(Peterson, Quay, and Cameron, 1959; 
Peterson, Quay, and Tiffany, 1961) 
have called “psychopathy” and “neu- 
roticism.” 

None of the evidence in this review 
requires the abandonment of trait the- 
ory, though the author believes it does 
suggest a considerable modification of 
the usual forms of trait theory. When 
one deals with verbal judgments, the 
behavior of subjects, the perceptions of 
observers, and the properties of situa- 
tions can all contribute to patterns of 
variation and covariation among the 
ratings, and all must be included 
in any comprehensive, operationally 
defined descriptive system. Relative 
strengths of influence can only be dis- 
tinguished by systematic variation of 
subject behavior, observer tendencies, 
and situational characteristics, with at 
least two independent procedures for 
examining each. Designs of adequate 
complexity are seldom employed, and 
until such designs are used more sys- 
tematically most of the important ques- 
tions about situation, role, personality, 
and behavior will remain unanswered. 


IMPLICATIONS 


If broad group factors are gener- 
ally more efficient and stable than the 
narrow "primary" factors with which 
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most investigators deal, attention should 
probably be concentrated on the for- 
mer. At least higher order factors 
should not be omitted from research, 
as is so often done when conventional 
criteria of factor inclusion are em- 
ployed. Programs of descriptive re- 
search should begin with the smallest 
possible number of constructs. The 
system should be expanded and refined 
only as new information may require. 

If one wishes to deal with more spe- 
cific aspects of personality, conven- 
tional factor analysis seems to offer 
little advantage as a data-reducing ор- 
eration. То date, most narrow pri- 
mary factors in the personality realm 
have not convincingly passed the tests 
of efficiency and invariance. If this is 
so, and if factors can be fractionated 
infinitesimally, psychology’s descrip- 
tive dilemma will be unresolved by 
continuing to generate more and more 
dimensions with each new analysis. In 
any single investigation, study of more 
numerous and limited behavior Seg- 
ments can usually proceed as well by 
examination of the elemental variables 
themselves, 

If the major factors which emerge 
from analysis of verbal data are gen- 
erally more relevant to idiosyncratic 
meaning structures than to behavioral 
traits, the scope of any inference drawn 
from verbal responses to verbal stimuli 
should be sharply limited at the start, 
and extended only as new relation- 
ships permit. At the strongest, one 
should speak of attitudes toward the 
self or other objects. Such an infer- 
ence is not devoid of significance, but 
it has some bounds, 

Ifa "personality" factor fails, as 
most seem to have failed, to exhibit 
satisfactory generality over media and 
across situations, inference to "trait" 
or any analogous personality construct 
is likewise unjustified. One is forced 


to use qualifiers, e.g., “adjustment as 
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seen by teachers in a school setting.” 
This is cumbersome, but it may be 
necessary. Contemporary assessment 
practices emphasize extensive testing 
of the individual in a single setting. 
If people are as “different” as they 
appear to be in different situations, 
and when viewed from different per- 
spectives, it is clearly essential to seek 
more information than most psycholo- 
gists presently do about these situa- 
tions and from those perspectives. _ 

If it is possible to get the critical in- 
formation in most ratings and ques- 
tionnaires with drastically abbrevi- 
ated instruments, it may be possible 
to obtain more useful information in 
other Ways at no great increase in as- 
sessment time. 
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THE EBB OF RETENTION? 


HARRY P. BAHRICK 


Ohio Wesleyan University 


Retention curves based u 


pon conventional techniques of representation 


are shown to be of limited usefulness because they confound the meas- 


CRITIQUE OF Present METHODS OF 
REPRESENTING RETENTION 


It is convenient to differentiate two 
types of indicants of retention for the 
purpose of the present discussion, The 
first type is based on a dichotomy of 
performance; the second refers to sav- 
ings scores based upon relearning of 
the material The term dichotomous 
means that in ordinary use the indicant 
yields only two Scoring categories for 
the observed response, i.e., the subject 
(5) either recalls correctly, or he fails 
to recall; he either Tecognizes correctly, 
or he does not. It is true that addi- 
tional scoring categories are Possible, 
eg., classification of types of errors 
made, or of reaction times, but these 
have not formed the basis for most 
retention curves shown in the litera- 
ture. Usually authors Teport the per- 
centage of correct responses on the or- 
dinate, and time on the abscissa. The 
interpretation of these curves presents 
problems which are quite distinct from 
those associated with the interpretation 
of curves based upon savings scores 


1 This research was supported in whole by 
the United States Public Health Service, 
Grant MH-05685-02 to Ohio Wesleyan Uni- 
versity. The writer wishes to thank David 
Bakan for valuable criticisms, 
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(the only nondichotomous indicants), 
and for this reason the two types of in- 
dicants will be discussed separately. 


INTERPRETATION or RETENTION 
Curves BASED upon Dicuoromous 
Inpicants 


In typical verbal learning and reten- 
tion studies Ss are repeatedly presented 
a sequence of verbal items, or a number 
of paired associates, Tests are later 
made to see how many of the items are 
recalled or recognized correctly, and if 
these tests are made at several points 
in time a retention curve can be drawn 
to trace the change of performance. А 
negatively accelerated, or classical 
curve, first reported by Ebbinghaus in 
1885, is frequently observed when re- 


Call measures or Savings scores аге 
used. In a previous article (Bahrick. 
1964) it was Pointed out that past in- 


terpretations of the slope of such curves 
are equivocal because the slope is 1- 
fluenced by artifacts of the distribu- 
tion of associative strength. These 


arguments will now be reviewed an 
elaborated, 


The assumption of a continuous dis- 


tribution of associative strength is bas! 
to the argument, This simply means 
that when Several Ss learn a list 0 
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paired associates or a sequence of 
items, the associations they have formed 
at the end of training will probably 
not all be of exactly the same strength. 
Rather, if associative strength can vary 
along a continuum, some associations 
A с stronger than others, and this 
form са for the several associations 
Ss > d y each S, as well as among 
reda one can thus conceive of an 
es distribution of associative 
tius ч п. Тһе measurement of reten- 
ЕЛ | a particular dichotomous indi- 
Ginn tell us no more about this 
eere ion than the percentage of as- 
ihe з which are above and below 
fo d omentary threshold value related 
nat indicant. Thus if 10 Ss have 
ated a list of 10 paired associates, 
iie бы, ht find that of the total of 100 
оо required оп a recall test 40 
t ay and 60 are wrong. The 
be ore assumption demands that 
тебене the 40 correct responses will 
the inel A strength barely above 
s eshold of recall, while others will 
ar above threshold, and some of the 
Dea, responses will be barely be- 
far HS strength, while others are 
рош ow threshold strength. The 
Beh oi recall score fails to reveal 
din nM information about the 
$ умне, ion. If a later recall test yields 
ree x retention score, ie. fewer cor- 
the it ig the comparison tells us 
Maium er or percentage of asso- 
tires, an have passed below the 
in ED d value during the time interval 
E stion. It will now be shown that 
snp нА drop in a retention 
the di epends upon (a) the variance of 
istribution of associative strength, 

x = relation of the threshold of 
the ieee the mean strength of 
istriby, ution, (c) the shape of the 
Which = ae and (d) the actual rate at 
hëse пе associations weaken in time. 

Se several factors are inextricably 


confounded in the retention curve be- 
cause no independent measures are 
available for them, and thus the cus- 
tomary interpretations of the slope of 
the curve as a reflection of the rate of 
decay of memory traces becomes equiv- 
ocal and is frequently misleading. 


Slope as a Function of Variance of 
the Distribution 


The effect of variance of the distribu- 
tion upon the slope of the curve can 
best be illustrated by conceiving of an 
extreme case. Let us assume that all 
associations formed by all Ss are 
exactly of the same strength, i.e., the 
variance of the distribution is zero; and 
let us further assume that all associa- 
tions weaken at the same rate in time, 
so that the variance remains zero dur- 
ing the course of forgetting. It would 
follow that any retention curve based 
upon dichotomous indicants would 
show 10046 correct performance until 
the associations had been weakened 
sufficiently to reach the threshold value. 
At this point they would all simul- 
taneously be “forgotten” and the curve 
would show a step-function drop from 
100% to 0% of retention, regardless of 
the rate at which the associations had 
weakened. The rate of weakening and 
the original strength in relation to the 
particular threshold used would deter- 
mine only the time at which the drop 
from 100% to 0% would occur, but 
not the slope itself. If the variance is 
larger than zero, i.e., if some associa- 
tions are stronger than others, the 
weaker ones would cross the threshold 
before the stronger ones, yielding 
something other than a step-function 
drop. Other things equal, the larger 
the variance of associative strength, the 
ater the time intervals separating 


gre i 1 
the threshold crossing of successive as- 
sociations, and the more gentle the 


slope of the retention curve between 
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—.— THRESHOLD 


OF RECALL 


— THRESHOLD 
OF RECOGNITION 


Fic. 1. A normal distribution moving 
Past two thresholds, 


two points in time. This generalization 
would hold regardless of the mean rate 
at which associations lost their strength, 


Slope as a Function of Mean Associative 
Strength in Relation to the Indicant 


It is apparent that the thresholds re- 
lated to several dichotomous indicants 
need not be the same, Thus, it is gen- 
erally thought that the threshold for re- 
call is higher than for recognition, i.e., 
that the association $ has formed may 
be sufficiently strong for a correct 
recognition response, but not for a 
correct recall response, (It was 
demonstrated in another paper by 
Bahrick and Bahrick [1965] that this 
is not an intrinsic difference, but de- 
pends upon the choices offered on the 
recognition test.) One means of con- 
ceptualizing this in relation to the dis- 
tribution of associative Strength is 
Shown in Figure 1. At Time t the 
distribution is partly below the recall 
threshold, but completely above the 
recognition threshold, Thus, perform- 
ance on recall might be 80% correct, 
and on recognition 100%. At Time t' 
the distribution has lost strength, and 
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has fallen for the most part below the 
recall threshold, but has only begun to 
fall below the recognition threshold. 
The new scores might be 2096 correct 
for recall and 9095 for recognition. 
Thus, during the interval from t to t 
the slope of the curve based upon the 
recall indicant is much steeper than 
the slope of the curve based upon the 
Tecognition indicant, Recall perform- 
ance has dropped by 60%, recognition 
performance by only 10%, despite the 
fact that the rate of weakening of the 
associations has been assumed constant. 
Thus, the Sensitivity of a particular 
indicant depends upon the momentary 
threshold of the indicant in relation to 
the mean of the distribution of associa- 
tive strength at the outset of the time 
interval in question. Failure to con- 
sider such artifacts has led to over- 
generalizations and misinterpretations 
Tegarding the rate of forgetting for 
“recognition versus recall memory 
(Bahrick, 1964), as well as misinter- 
Pretations regarding the slope of learn- 
ing curves (Bahrick, Fitts, & Briggs, 
1957), 


Slope as a Function of the Shape of 
the Distribution of Associative Strength 


Figure 2 shows Several possible dis- 
tributions of associative strength grad- 
ually passing through a threshold, and 
below each type of distribution the re- 
lated retention curves, Assuming only 
for the Purpose of illustration that as- 
sociations weaken at a constant rate 10 
time, the Considerable influence of the 
shape of the distribution upon the slope 
of the retention Curve is apparent an 
does not require elaboration. Classical: 
i.e., negatively accelerated curves of re- 
tention can be obtained as the result 0 
a distribution like the one shown in 

i » although every one of the 


associations lost strength at a constant 
rate in time, 
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% RETAINED 
a 
o 


9 o 


Gan Len illustrations have shown 
curve a percentage drop ofa retention 
is aiot one a particular time interval 
which Ro y a function of the rate at 
ing thie sociations have weakened dur- 
aice mi Me but also of the vari- 
associative. ape of the distribution of 
of the me strength, and of the relation 
threshold p of that distribution to the 
рь 4 of the indicant being used. 
hus xs control or assess these vari- 
Meaningful. independently precludes 
Amount, к. comparisons of relative 
intervals of retention for different time 
ing the < or for different indicants dur- 
ame time intervals. 


I 
pue or RETENTION CURVES 
SED UPON SAVINGS SCORES 


ТІ 
пе only commonly used indicant of 


Tetentj 
i Bru е 
on which is nondichotomous, i.e., 


THRESHOLD 


о 


ТІМЕ 


Fig, 2 
- The effect of the shape of the distribution of associative strength upo! 


m retention curves. 


usually yields more than two scoring 
categories, is based upon relearning 
scores. Ultimately, of course, savings 
scores are also based upon dichotomous 
measurement, i.e., the responses of Ss 
are still only scored as above or below 
the threshold of recall or recognition 
on each trial. However, the derived 
scores based upon the comparison of 
learning and relearning trials yield a 
continuum of "savings." Present criti- 
cism relates to problems which arise 
from (a) unreliability, and (5) the use 
of trials as units of measurement. 


Unreliability of Savings Scores 

this method were 
(1922) who con- 
that the method 


factory results than 
[p. 35].” This 


The limitations of 
recognized by Luh 
cluded empirically 
“produces less satis 
does any other method 
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is true because the reliability of the 
comparison of learning and relearning 
trials is limited not only by the 
momentary fluctuations of dichotomous 
thresholds, as are all the other methods, 
but also by the cumulative unreliability 
contributed by many other variables 
which affect the rate of acquisition in 
the two learning processes to be com- 
pared. . 

Further, the comparison of learning 
and relearning rates has traditionally 
been limited to the point at which 
100% of the material has been mas- 
tered, i.e., the point at which the entire 
distribution of associations has passed 
above a given threshold during learn- 
ing. A single point of this Sort does 
not represent a very reliable base for 
comparison, and the 10076 point is less 
satisfactory than other points on the 
distribution, €g., the 50% point. A 
more reliable procedure would base the 
Savings score upon an average compari- 
son of several points, e.g., the savings 
effected in mastering 25%, 50%, and 
75% of the material. Similar prin- 
ciples were Tecognized long ago in con- 
nection with Psychophysical problems, 
but they have not been applied to the 
methodology of retention studies where 
they are equally relevant. 


Use of Trials as Units of Measurement 


If retention is determined by a com- 


Й 


ave become the 


units of retention, The use of such 


units can form the basis for inferences 
about changes in associative strength 
only if we can assume that the gain in 
associative strength, or the amount 
learned per trial, is reasonably con- 
stant. If we assume, however, that 
the learning process js negatively ac- 
celerated, i.e., that Progressively less 
associative strength is developed on 


successive trials, then saving the last 
trials has a very different meaning from 
saving the earlier trials. A vut 
accelerated curve of forgetting coul 
result, even though associations weak- 
ened at a constant rate in time. This 
could occur because many of the last 
trials during which little associative 
strength was gained are saved at first, 
and later progressively fewer of the 
earlier trials are saved, during which 
progressively more was learned. 

The loss of associative strength т 
flected by a single trial (i.e., the size r 
the unit) also varies as a function О 
individual differences in rate of [ап 
ing. Thus, if an association has G 
fallen below threshold and is relearne 
іп a single trial, this small amount 0 
forgetting would be represented as а 
saving of 8096 for an individual = 
originally learned in five trials, s 
the same loss of associative strengt, 
would be reflected by a saving of 90% 
for another S who originally learne 
in 10 trials. Tt is apparent that the 
slope of retention curves based upon 
Savings scores cannot be interpreted : 
terms of losses of associative strengt 
in other than an ordinal sense. 


Use or VARIANCE Units ок EBBS 


If retention curves are to yield infor 
mation about relative rates of fof 
getting, ie, the rate of weakening 
associations during different u 
periods, it is necessary to use equal 2 
tervals on the ordinate, Further, is 
must either experimentally control v 
variables causing distributional p 
facts, or develop techniques which stis 
less sensitive to them. Опе poss! e$ 
technique is proposed here. It ively 
plotting estimates of the change of is 
mean of the distribution of associati 
Strength in units of variance of Thé 
distribution on the ordinate. on 
original observations can be based ЧР 
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Fr ^ icipati 
с. 3. Percentage correct anticipation an 


Teca € 
of азн or any combination 
holds, but аратар to different thres- 
as Telere the thresholds are used only 
mean of ig points for locating the 
To ill he distribution. 

or ma the technique, the curve 
Study (1922) v performance in Luh’s 
convention 2) which is plotted by the 
е compared method in Figure 3 can 
Plotted in eh with the same curve re- 
Obseryeq ebb units in Figure 4. Each 
18 Eie е anticipation value 
ing а tabl into a Z value by consult- 
Curve, Th of integrals of the normal 

€ Z value gives the distance 


20 
TIME IN HOURS 


o——— RECOGNITION 
о—— —o ANTICIPATION 


30 40 


4 recognition from Luh (1922). 


from the threshold point to the mean of 
the distribution in sigma units. If 
more than 50% of the responses are 
initially correct, as is the case here, 
the mean of the distribution is above 
the threshold and the first Z value is 
As forgetting progresses to 


positive. 
the point where only 50% is recalled 
the mean of the distribution has 


reached the threshold, and the Z value 


has become zero. When the per- 
centage correctly recalled is less than 
50 the mean of the distribution has 
passed below the threshold, and the Z 
value becomes negative. To designate 
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Fic. 4. Retention curves based on ebbs from data by Luh (1922). 


the beginning of the forgetting process 
an ordinate value of Zero may arbi- 
trarily be assigned to the first Z score 
after learning has terminated. Subse- 
quent values on the ordinate are ob- 
tained by subtracting each converted Z 
score from the Original one, and thus 
the course of forgetting is plotted in 
sigma units or “ebbs,” reflecting the 


TABLE 1 


PERCENTAGE ANTICIPATION FROM Lun 
CoNvERTED to Еввѕ 


Percentage ar 

anticipation transformation Ebbs 
67.8 46 0 
50.2 :06 40 
39.0 —.28 44 
17.8 —.92 1.38 
10.0 —1.29 1.75 


of 
downward movement of the mean ih 
the distribution of associative SIRER 
The percentage anticipation, Z conve! 


1n 
Sion, and ebb values are shown 
Table 1. 


ADVANTAGES OF THE METHOD 


Conventional retention curves ba : 
upon dichotomous Scoring indicate oe 
Percentage of associations which n 
below a threshold value during а give 
time interval, It has been shown V T 
the slope of such Curves cannot pur. 
a direct basis for comparison of pt 
of associative Strength during differe n 
time intervals. The transforma 
Proposed here does permit such а a 
parison, Provided certain assumpti? n 
can be met, The curve based i od 
ebbs shows how much the mean as 
ciation has weakened, The use 
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Sigma units on the ordinate prevents 
the slope of the curve from being a 
function of the variance of the distribu- 
tion, as long as the variance remains 
constant. This is true for the same 
reason that conversion of raw scores 
to Z Scores achieves comparability de- 
spite differences in mean and sigma of 
the raw score distribution. While we 
are no better off in comparing “the 
amount" of forgetting for two different 
Studies, unless we can compare the 
variance of the distributions involved, 
It is possible to compare the curves 
yielded by different indicants over vari- 
Ous time intervals within the same 
Study, provided the variance has sta- 
bility. Ву conventional methods it has 
not been possible to determine whether 
recognition, recall, and other indicants 
Yield basically different time functions 
lor forgetting, because the obtained 
curves were differentially affected by 
artifacts of changing sensitivity of the 
indicants, illustrated in Figure 1. The 
Proposed technique permits a reexami- 
Nation of such questions under more 
favorable conditions. 

The shape or form of the distribution 
Continues to have an effect upon the 
Slope of the ebb curve. The rectilinear 
and normal distributions shown in Fig- 
ures 2A and 2C will now produce iden- 
tical curves, and this is equally true for 
T other type of symmetrical distribu- 
ü n, provided an unbiased estimate of 

he position of the mean is obtained. 
us unbiased estimate can be ob- 
р €d, as is likely for distributions de- 
ria] markedly from normality, the 
rel functions will be affected, as 
desis е shown later on in more detail. 
{рар produced by using different 
TE such as recall and recognition, 
i не. ed in Figure 1, will disappear, 
tributi are dealing with a normal dis- 
the di on, but continue to be present if 

istribution departs markedly from 


normality. If the distribution is nor- 
mal and maintains constant variance, 
and if recall and recognition refer to 
different thresholds dichotomizing the 
same distribution at different levels, 
then it should make no difference at 
all which of the thresholds we use as 
our reference point in tracing the 
course of the mean of the distribution. 
Departures from normality will result 
in different types of bias in locating the 
mean with respect to different thresh- 
olds, and thus, the method provides a 
potential source of information about 
the shape of the distribution. An illus- 
tration of this use of the method will be 


given later. 


AssUMPTIONS UNDERLYING THE USE 
or THE PROPOSED TECHNIQUE 


The assumptions, already implied 
above, are of two types. The first, and 
most important, refers to the variance 
of the distribution; the second, to the 
thresholds of different indicants in re- 
lation to the distribution. Plotting 
changes of the mean in sigma units is 
meaningful only if the sigma of the dis- 
tribution remains relatively constant 
throughout the course of forgetting. 
It would not matter if individual asso- 
ciations lost strength more quickly than 
others, as long as compensatory varia- 
tions of this type left the variance of 
the entire distribution stable. 

Direct tests of the validity of this 
assumption are not possible, since no 
direct measures of variance are ap- 
plicable. The strength of individual 
associations can be measured only in 
relation to dichotomous thresholds. 
Even if one accepts the number of 
correct responses during training as 1n- 
dicative of associative strength, this 


indicant would yield only a single 
jance, and no informa- 


measure of var un t 
tion regarding the stability of the vari- 
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ance during the retention period could 
be deduced from it. 

The second group of assumptions 
relates to the question as to whether 
different dichotomous indicants such as 
free recall, anticipation, and recognition 
divide the same distribution at differ- 
ent levels. Alternatively, one might 
assume that these indicants, though of 
different difficulty level, do not assess 
the same associations, but rather relate 
to different types of concurrent learn- 
ing, and thus refer to separate distribu- 
tions. The most relevant evidence in 
favor of a unitary assumption is statisti- 
cal. It has been shown repeatedly 
( Postman, Jenkins, & Postman, 1948) 
that high correlations exist between 
items correctly recognized and those rc- 
called, i.e., that practically all of the 
items correctly recalled are also cor- 
rectly recognized. (But not vice versa.) 
If thelearning processes were independ- 
ent such high correlations would not 
occur. Accordingly, Postman, Jenkins, 
and Postman concluded: The basic dif. 
ference between the two tests appears 
to lie in the minimal strength of asso- 
ciation which they require for success- 
ful performance, 

An important qualification must be 
stated, however. Performance on re- 
call, as well as on recognition tests, 
may or may not depend upon memory 
of sequences, Ina free-recall test, for 
example, S may be asked to write down 
as many of the items as he can recall, 
and the responses can be scored with- 
out regard to the sequence of presen- 
tation. Likewise, on a recognition test 
S may be asked to identify the items 
among other incorrect ones, without re- 
gard to sequence of presentation, The 
assumption of a single distribution 
dichotomized at two levels 


appears 
reasonable in this situation. It is also 
possible, however, to score sequential 


associations in either the recall or the 
recognition test. The method of anti- 
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cipation demands such association on 
the recall level. To test these on the 
recognition level a variation of the 
procedure of reconstruction can be em- 
ployed, in which the correct items are 
presented in various sequences, and S 
must identify the correct sequence. It 
is reasonable that learning in a paired- 
associate task involves a two-phase 
process, where sequential associations 
are formed only after individual items 
have been differentiated. If one, but 
not both the recall and recognition 
tasks requires memory of sequences, 
the assumption of a single distribution 
dichotomized at two levels would seem 
less justified, 


INDIRECT Evivence RELATED 
TO THE AssuMPTIONS 


Investigations in which several types 
of indicants are used to test retention 
at several points in time are a poten- 
tial source of evidence in regard to 
both types of assumptions previously 
discussed. If the variance of the dis- 
tribution remains constant during the 
course of forgetting, and if this same 
distribution is dichotomized by two 
thresholds of different difficulty level, 
performance differences on the two 
tests, when converted into Z Scores, 
should remain relatively stable over the 
total retention period, Referring to 
Figure l, we would expect equal Z- 
Score intervals to Separate performance 
on recall and Tecognition tests at Time 
t, and any later time at which both 
thresholds dichotomize the distribution. 


confound the later 
(5) use different Ss at 


vide a Possibility Teexamining the 
same distribution, In the latter case 
they provide only another sample of a 


Ess or RETENTION 69 
TABLE 2 
Z SCORE DIFFERENTIALS IN SEVERAL STUDIES COMPARING RECALL AND 
" RECOGNITION PERFORMANCE 
Luh 
20 1 4 1 2 
minutes hour hours day days 
Гене correct recognition 97.8 94.6 93.3 74.6 71.5 
ee correct anticipation 67.8 50.2 39.0 17.8 10.0 
Se in percentage 300 444 843 568 61.5 
ja eel Z value 2.01 1.61 1.50 .66 57 
т.1001ра оп 2 value 46 0.06 —.28 —.92 —1.28 
ifference in Z values 1.55 1.55 1.78 1.58 1.85 
Miler 
5 1 6 21 4 14 
seconds hour hours hours days days 
percentage correct recognition 65.8 65.8 54.1 55.0 50.4 38.3 
yrcentage correct recall 41.6 35.0 26.6 27.5 21.6 6.6 
= ifference in percentage 24.2 30.8 27.5 27.5 28.8 91.7 
pecognition Z value Al Al .10 43 01 —.30 
Recall Z value —21 -—.39  —.63 —.60 —.79 —1.51 
Difference in Z values .62 .80 43 13 .80 1.20 
Bahrick and Bahrick 
Immedi 2 2 2 
ately hours days weeks 
Percentage correct—easy recognition 80.0 857 771 38.6 
pos entage correct—anticipation 78.6 62.9 65.7 35.7 
Difference in percentage 14 226 114 2.9 
Easy recognition Z value а 107 4 —29 
Anticipation Z value 59. 33 40 —37 
ifference in Z values 05 8 3 06 
Bahrick and Bahrick 
E 2 2 
oum) ; hours days weeks 
Percentage correct— difficult 
recognition 543 286 414 14.3 
ercentage correct—anticipation 78.6 629 65.7 35.7 
ifference in percentage 24.3 363 24.3 214 
Arne recognition Z value Ab =S —22 hf 
nticipation Z value 19 33 40 —37 
68 90 € 70 


ifference in Z values 


Similar distribution. Another limita- 


Table 2 summarizes data from three 
t investigations in which two 


tion arises from the type of tests used. different inv 4 
n hese may require sequential learning ог more indicants of retention were re- 
Or one, but not both of the tests, and corded at different time periods. The 

н the Z-score differentials із 


of they may relate to different stages 
n a two-stage learning process, as dis- 
Ussed above. 


stability of 
most impres 
anticipation per 


sive for the recognition and 
formance in Luh’s study 
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where the same Ss were used re- 
peatedly. Despite the fact that differ- 
ences between recognition and antici- 
pation performance increase from 30 to 
60%, the Z-score differentials remain 
remarkably constant during the entire 
range of time from 15 seconds to 2 
days. The data of Miler show stabil- 
ity of Z-score differentials for all except 
the 2-week interval, and the data by 
Bahrick and Bahrick (1965) show 
some stability for the differentials be- 
tween a difficult recognition task and 
an anticipation task, but not for the 
comparison of an anticipation and an 
easy recognition task. A reminiscence 
effect in performance on the ease recog- 
nition task may account in part for the 
instability, as well as the assessment of 
sequential learning in anticipation per- 
formance, but not in connection with 
recognition performance. A search of 
the literature revealed several other 
studies in which two or more indicants 
of retention were recorded at different 
time intervals, but they did not lend 
themselves to the above type of analy- 
sis. Data from a study by Myers 
(1914) are not presented because they 
are based upon only two time intervals. 
Data from studies by Postman and Rau 
(1957) and by Burtt and Dobell 
(1925) could not be analyzed. In the 
former case, recognition performance 
remained nearly 100% correct through- 
out the retention period, and thus the 
threshold of recognition did not 
dichotomize the distribution SO as to 
yield a reliable reference value (a situ- 
ation similar to the one shown in Fig- 
ure 1 at Point t). In the latter case, 
recall performance remained near 0% 
after the first measurement, In this 
connection it is important to note that 
percentage values become less reliable 
as they approach the values of either 
100% or 0% and that this unreliabil- 
ity is magnified by a Z-score transfor- 


BAnRICK 


mation which increases differentiation 
of percentage values near the extremes. 
Consequently the transformation de- 
scribed here should probably not be 
used when scores on the dichotomous 
task are extreme. This would apply 
to any study in which training con- 
tinued to the point at which 100% of 
responses are correct, or above thresh- 
old. 

It is not possible to assert on the 
basis of the evidence from these studies 
that the variance of distributions of as- 
sociative strength does not change dur- 
ing the course of retention. No statis- 
tical test of the significance of changes 
in the Z-score differentials is applicable 
because data are not available for indi- 
vidual Ss. Even if such data were 
available, and failed to reveal signifi- 
cant changes, they would not permit 
a proof of the null hypothesis. If one 
keeps in mind, however, the fact that 
different Ss are involved for successive 
time intervals and that the Z-score dif- 
ferentials show no consistent trend 
toward increasing or decreasing over 
intervals ranging from 15 seconds to 
2 weeks, but rather show comparative 
stability, the assumption of constant 
variance appears plausible. 

It would seem necessary to assume 
that variance values would eventually 
decrease as memory traces continue to 
weaken, but this may occur beyond the 
range of Sensitivity of available dicho- 
tomous indicants. Only the 2-weeks 
value obtained from the data by Miler 
Suggests such a phenomenon during 


the time Tange covered by these inves- 
tigations, 


EVIDENCE REGARDING THE SHAPE 
OF THE DISTRIBUTION 
The analysis presented so far lends 
some support to the assumption of 
stability of variance, but offers little 
evidence regarding the shape of the 
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d on ebbs from data by Miler (1957). 


are based upon integral 
values of the normal distribution, and 
thus, divergence of the two estimates 
offers some clues in regard to de- 
partures of the distributions from 


normality. 
If a distribution is less peaked than 


a normal one, ie. if it has relatively 
more area at the extremes, we would 
expect exaggerated estimates of the 
drop in the position of the mean, when 
the estimate is based on percentage 


estimates 
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Fic. 6. Retention curves based on ebbs from data by Bahrick and Bahrick (1965). 


changes recorded near the extremes of Conversely, with a distribution y 
the distribution. However, if the peaked than a normal one, the drop "t 
threshold is passing near the mean of the mean would be underestimated i 
the distribution, the predicted drop in the reference point is at the a 
the mean would be an underestimate. but overestimated if the reference poin 
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is near the center. Comparison of the 
magnitude of Z-score changes of the 
mean predicted from the two reference 
points (values in Table 2) shows the 
following: The more extreme reference 
point leads to smaller estimates of the 
drop in the position of the mean 
(smaller Z-score changes) when the 
Percentage values are above the mean 
(larger than 50%); if the values are 
below the mean (smaller than 50%) 
the more extreme score leads to larger 
estimates of the drop in the position of 
the mean (larger Z-score changes). 
This is consistent for all three studies, 
regardless of whether the curve based 
upon recognition or recall is the higher 
one, 

It would appear from these facts 
that the distribution is somewhat asym- 
metrical, i.e., that the upper part of the 
distribution is peaked and shows less 
area at the extreme than a normal 
distribution, but that the lower part of 
the distribution is skewed, i.e., shows 
more area in the extreme than a normal 
distribution. 

These interpretations are tentative, 
and will require additional support be- 
fore they can be accepted with confi- 
dence, However, it is apparent that 
the method which has been described 
Offers a variety of opportunities for ex- 
ploring aspects of the retention process 
and of the distribution of associative 
Strength, which have not been available 
to the conventional techniques of repre- 
Sentation. Further information about 
these distributions during the course of 
Acquisition, as well as during the course 


of retention, should prove to be of 
value in the analysis of phenomena of 
learning and retention. 
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or even reasonable, to restrict the term 
“defense” only to a failure to recognize 
a “threatening” stimulus, since an or- 
ganism which fails to recognize danger 
would, at first glance, seem especially 
defenseless. It has been suggested that 
the term was applied in this particularly 
restricted way because of an analogy 
to the ego defenses of psychoanalytic 
theory (e.g, Howie, 1952). Such an 
analogy is false because the ego de- 
fenses are not restricted only to those 
which cause a failure to recognize ex- 
ternal stimuli (e.g., Fenichel, 1945; 
Freud, 1937 ; Lazarus, 1954). 
Recognition thresholds which are 
either raised or lowered have been in- 
terpreted as due to defenses, and this 
view has received experimental support 
(e.g., Chodorkoff & Chodorkoff, 1958; 
Deese, Lazarus, & Keenan, 1953; 
Eriksen, 1951a, 1951b, 1954a, 1954b, 
1954c; Eriksen & Davids, 1955; Erik- 
sen & Lazarus, 1952; Lazarus & Longo, 
1953; Miller, 1954; Truax, 1957), 


Perceptual Vigilance 


Dulany (1957), among others, has 
argued that the really important form 
of perceptual defense is a “vigilant” 
tendency to recognize emotion-arousing 
stimuli unusually readily, although its 
presence is often obscured by response 
bias. 

A mechanism of perceptual vigilance 
is both plausible and of value in theory, 
It need only strengthen or clarify the 
perceptual response during the process 
of recognition. It need not involve a 
“super-discriminating preperceiver” 
who selectively prevents recognition, a 
preperceiver who has proved difficult 
to find (Eriksen, 1960). A theory of 
vigilance has been recognized as im- 
portant to the explanation of stimulus 
detection (Frankman & Adams, 1962). 

The constructs of such a theory might 
well include a mechanism of perceptual 
vigilance. 


Minarp 


HIGH 


RECOGNITION THRESHOLD 


Low 


easa], 
Low HIGH 


STIMULUS EMOTIONALITY 


Fic. 1. The relation of recognition thresh- 
old to level of stimulus emotionality (sug- 
gested by Brown, 1961; Bruner & Postman, 
1947; DeLucia & Stagner, 1953; ete.) 


Formulation Integrating Previous Find- 
ings and Conceptions 


In his recent monograph, Brown 
(1961) argues that perceptual defense 
is best defined as a descriptive term for 
any systematic relationship between 
stimulus emotionality and the ease of 
recognition of stimuli, He implies, as 
have other reviewers, that both ele- 
vated and lowered thresholds are part 
of the same functional 
(Inglis, 1961), 

Some of the earlier studies (e.g. 
Bruner & Postman, 1947; DeLucia & 
Stagner, 1953), recent reviews (e.g. 
Brown, 1961; Inglis, 1961), and an 
additional Study by Brown (1961) sup- 
Port the interpretation that threshold 
first rises, then falls with an increase in 
stimulus emotionality (Fig, 1), Such 
a complex relationship does not sup- 
Port a restricted definition of perceptual 
defense, or the "one-tailed" statistical 
tests which have so frequently been 


criticized (e.g., Eysenck, 1961; Hick, 
1952), 


relationship 
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Criteria for Studies Evaluating a Re- 
Sponse-Bias Hypothesis 

Thus far this paper has attempted to 
demonstrate that studies supporting 
a response-bias hypothesis have often 
been particularly restricted in their 
approach. They may constitute an 
attack on studies using an overly re- 
stricted conception of perceptual de- 
fense, However, they do not neces- 
sarily conflict with perceptual-defense 
Studies using the more complex con- 
ception suggested by research done 
since the early work (eg. Brown 
Covers well over 300 papers). Per- 
ceptual defense can be conveniently 
reviewed in the context of the criteria 
research has suggested for a study 
evaluating the response-bias hypothe- 
Sis. 

1. As was shown by the controversy 
following the McGinnies study (1949), 
measurement of perceptual defense re- 
quires that the experimenter avoid 
using unexpected or socially unaccept- 
able stimuli. Otherwise either response 
Suppression or a lack of familiarity with 
Certain stimuli provide a ready explana- 
tion of results (Freeman, 1954, 1955; 
Howes & Solomon, 1950, 1951; Lacey, 
Lewinger, & Adamson, 1953). 

2. It seems important to use stimuli 
Chosen to be personally emotion arous- 
Ing for the subjects for whom they are 
used (Eriksen, 1954a) and to incorpo- 
Tate adequate controls for word fre- 
quency. A desirable control for indi- 
vidual response probability would be 
allowed by response bias measures ob- 
tained during the course of a perceptual 
defense study. 

3. Results should not be averaged 
for stimuli which differ greatly in their 
“Mmotion-arousing properties if Figure 

has any validity. The low threshold 
Or some stimuli and the high thresh- 
Olds for others would be averaged out 
When combined. 


4. Averages computed for subjects 
with quite different personalities may 
also obscure results. The subject's 
personality may serve to alter the effec- 
tive level of stimulus emotionality 
(Brown, 1961; Inglis, 1961). Brown, 
as he predicted, found evidence that in- 
troverts reach their highest recognition 
threshold at a lower stimulus emotion- 
ality than extroverts. Variables were 
defined by scores on the Maudsley 
Personality Inventory (Eysenck & 
Claridge, 1962). 

Other individual differences in per- 
ceptual defense have been reported as 
well (Eriksen, 1951a, 1951b, 1952a, 
1952b; Eriksen & Davids, 1955; Erik- 
sen & Lazarus, 1952; Lazarus & Longo, 
1953; Truax, 1957; Wall & Guthrie, 
1959). Mathews and Wertheimer 
(1958) found perceptual defense sig- 
nificantly greater than response bias 
in subjects high on the MMPI Hy 
scale (Dahlstrom & Welsh, 1960) but 
not for those high on the Pt scale. 
It may be that some learning histories 
and personalities lead to perceptual and 
some to nonperceptual forms of defense. 

5. Averages and comparisons which 
involve different situations may also 
serve to obscure or confuse results. 
The experimental situation (only partly 
described by instructions to subjects) 
may also serve to alter the effective 
level of stimulus emotionality. For 
example, Brown found that with fe- 
males he was able to demonstrate the 
whole curve shown in Figure 1; with 
males, only the first (ascending) por- 
tion of the curve was demonstrated. 
The sexes differed by about three 
standard deviations in their thresholds 
to the most emotional words. He sug- 
gested that being alone with a male 
experimenter was more emotion arous- 
ing and the effective level of stimulus 
emotionality was greater for females 
than for males. Perceptual interpreta- 
tions of modifications of perceptual de- 
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fense by slight changes in instructions 
have also been given by Blum (1954), 
Clapp (1951), and Nelson (1955). 

The changes produced by different 
experimental situations and instruc- 
tions have also been taken to imply that 
perceptual defense is not perceptual 
(e.g., Postman, Bronson, & Gropper, 
1953; Nothman, 1962). Certainly 
studies of verbal behavior have shown 
the effects of the experimenter's be- 
havior on response probabilities (e.g., 
Krasner, 1958; Salzinger, 1959). The 
subtle cues he provides either intention- 
ally or accidentally may have complex 
effects (Sarason & Minard, 1963). 
The precise effect of instructions may 
depend on the nature of the task the 
subject performs (e.g, Sarason & 
Minard, 1962). 

Either a wholly perceptual or a 
wholly nonperceptual interpretation of 
changes in perceptual defense seems 
premature for studies in which per- 
ceptual measures are thought to be af- 
fected by response bias and response 
bias is not measured. 

6. Since there are many complex 
differences between subjects and situa- 
tions it seems important to obtain meas- 
ures of perceptual defense and response 


bias in the same subjects and nearly 
simultaneously. 


AN EVALUATION OF THE RESPONSE- 
Bras HYPOTHESIS 


Studies have demonstrated that, un- 
der certain circumstances, response bias 
may be great when compared with per- 
ceptual-defense results (e.g., Goldstein, 
1962). However, those meeting cri- 
teria suggested above demonstrate that 
response bias does not satisfactorily ac- 
count for perceptual defense; it is nec- 
essary to develop explanations which 
allow the stimulus a greater role and 
which do not depend on mechanisms 
which have their main effects on purely 
verbal responses. 


MINARD 


Mathews and Wertheimer (1958), 
using visually presented words, found 
perceptual defense over and above any 
evidence for simultaneously measured 
response suppression. Minard (1963) 
replicated and extended their research 
(which used a stimulus selection tech- 
nique similar to that used by Brown in 
selecting highly emotion-arousing stint- 
uli). This study will be described in 
some detail because it illustrates major 
points of this review and allows more 
effective evaluation of previous work. 


Procedure 


Subjects. A sample similar to that 
of Mathews and Wertheimer (1958) 
was selected. None who were selected 
refused to participate as paid volunteers 
uninformed regarding the basis of s€- 
lection or the nature of the study. The 
26 males and 26 females were college 
students with scores in the upper quar- 
tile of a group of 312 MMPI HY 
scores, but below the median on the Pt 
scale. All were introductory psychol- 
ogy students naive regarding perceptual 
defense. 

Stimulus Selection. Stimuli were 
eight words selected by objective cri- 
teria applied to a word association test 
consisting of 46 potentially emotion- 
arousing words and of neutral words 
paired with them on the basis of struc- 
ture and frequency in English. Words 
for the test were selected by the agree 
ment of three psychologists familia 
with the college population, All word? 
were common and acceptable in stt; 
dent conversation (eg. “adjust 
"child" “dance”). Four emotion- 
arousing words were chosen for each 
Subject, and each was paired with ? 
neutral word. Chosen stimuli had 25" 
sociative reaction times within the 
longest 8% of the subject’s times, ve 
time long also when compared with the 
mean time for that word as found fo 
а norm group of the selected, like-5e*** 
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subjects. Other evidences of emotional 
arousal were observed or tape recorded 
(blushing, sweating, stuttering, long 
explanations of associations, etc.). 
This evidence was ignored in the selec- 
tion of emotional words, since it was 
not found to change the stimuli selected. 
| Stimulus Presentation. The rela- 
lively brief word association test was 
followed by a selection of the condi- 
lions for tachistoscopic presentation of 
stimuli, i.e., the settings at which the 
Subject's discrimination of structurally 
similar stimuli was 60-70% accurate 
and better than chance according to a 
binomial test. In this determination 
the subject held a card listing two neu- 
tral stimuli not selected for later presen- 
tation (eg. “line,” land"). He was 
instructed to name the word presented 
On each trial and to describe "exactly 
What he saw" on the screen. Immedi- 
ately before the perceptual defense ex- 
periment the determination was re- 
peated. Stimuli were presented with 
the first letter at center field by a 
Stroboscopically calibrated tachisto- 
Scope, accurate within 596. They sub- 
tended an average visual angle of 9.5 
degrees, 
. After this determination each sub- 
Ject was given a card listing the eight 
Stimuli selected for him, told "We'll 
do the same thing with different stim- 
uli" and given the instructions used by 
Mathews and Wertheimer (1958) "for 
every flash, tell me what word you 
think was presented." In addition, the 
Subject continued describing "exactly 
What he saw” when he wished. Stimuli 
Were then repeatedly presented at 10- 
Second intervals except for three regu- 
larly spaced 30-second breaks. Within 
Sach of the four blocks of 10 trials, the 
Stimuli used were randomly arranged. 
eae to the subject, half of the 
stimuli were never presented. Only 
"id €motional words and their neutral 
irs had been selected for presentation 


according to a procedure which equated 
presented and unpresented words for 
emotionality on the basis of all avail- 
able information. In addition, smudged 
blank slides had been added, at ran- 
domly determined positions, to each of 
the four blocks of stimuli. At the end 
of the experiment the subject was given 
a deck of index cards, on each of which 
was typed one of the neutral and one 
of the emotional words selected for him. 
The deck was randomly ordered and 
also included four pairs consisting of 
neutral words and four pairs consisting 
of emotional words. All words in all 
possible pairings were included in the 
deck. He was asked to "simply name 
one word from each card." From the 
initial meeting of experimenter and 
subject to a detailed, probing interview 
at the end of the experiment, all verbal 
behavior was tape recorded. Its evalu- 
ation by the experimenter was subse- 
quently checked with the aid of a male 
and a female clinician. 

As a result of the procedures which 
have been described, the selected emo- 
tion-arousing and neutral words were 
presented at a level at which the sub- 
jects frequently reported a "word-like" 
stimulus, sometimes confidently recog- 
nized stimuli, never discriminated 
blank slides from other difficult-to-dis- 
criminate stimuli, and obtained a fre- 
quency of accurate response signifi- 
cantly better than chance (usually 
about 30%). The determination of 
this level for presentation equated the 
subjects for general accuracy, which 
was not a variable of experimental in- 
terest. The practice it provided led to 
skilled and relatively stable perform- 
ance (eg. an absence of inappropri- 


ately timed blinks). 
Probability Formulation of Results 


The basis for the perceptual-defense 
measure was the difference between the 
probability of a neutral response which 
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was accurate and the probability of an 
emotional response which was accu- 
rate. As Goldiamond (1958) and 
others have demonstrated, this dif- 
ference is conceptually equivalent to 
differences between ascending-method- 
of-limits thresholds since it is the 
"sufficiently high" probability of an 
accurate response which determines 
the point at which the ascent ends and 
the threshold is recorded. However, 
use of this difference, rather than a 
“threshold,” aids in the comparison of 
perceptual defense with response bias. 

The basis of the response-bias meas- 
ures was the difference between the 
probability of a neutral response which 
was a guess and that of an emotional 
response which was a guess, deter- 
mined within some group of responses 
which corresponded so little to a dis- 
criminative stimulus that the term 
guess seemed an accurate description, 
е.5., any response to a blank slide or 
the response “suck” to the stimulus 
“dance.” When making such re- 
Sponses, the subjects sometimes com- 
mented “that was a pure guess,” 

The guesses used for estimating re- 
Sponse bias included (a) responses to 
blanks, i.e., guesses in the absence of 
discriminative stimuli, (b) all inaccu- 
rate responses, i.e., guesses made in the 
presence of various unrecognized stim- 
uli, (c) guesses naming stimuli never 
presented, i.e., particularly wild guesses, 
(d) guesses made when neutral words 
were presented, (е) guesses made when 
emotion-arousing words were pre- 
sented, (f) choices made from clearly 
presented words. Three additional re- 
sponse bias measures were obtained by 


correcting b, d, and e for chance ac- 
curacy.? 


2 Гог each subject group, separate cor- 
rections were made for each stimulus-re- 
sponse combination (neutral responses to 
neutral stimuli, neutral responses to emo- 
tional stimuli, etc.), However the logic of 
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Thus the experiment yielded a group 
of response-bias measures obtained un- 
der a variety of stimulus conditions and 
based both on response groups about as 
large as the group consisting of ac- 
curate responses and оп response 
groups nearly four times this large. 

The basis of each measure was the 
difference between the probability of a 
neutral response and that of an emo- 
tional response within a given response 
subset, ie., a difference between con- 
ditional probabilities. Results are con- 
veniently written and communicated 
in terms of the differences between per- 
centages. 

Convergent validity for response-bias 
measures is indicated by correlations 
obtained in 52 male and female subjects 


the procedure is the same in each case and i5 
sufficiently illustrated by describing the cor- 
rection of neutral responses to neutral stim- 
uli. The formulas correct for the probability 
of chance accuracy, P (chance accuracy): 
which is adjusted for the subject's tendency 
to choose words from the presented group 
and for the subject's response bias. Cor- 
rected response frequencies are used in place 
of the observed frequencies in computing 
corrected measures of response bias. 

l. Since only half the words on the sub- 
ject's list are presented, he may increase hi$ 
probability of chance accuracy by using these 
Presented words in his guesses. The prob- 
ability of chance accuracy is the joint prob- 
ability of (a) accuracy when words in the 
presented group are used in guesses, and ( ) 
choice from the presented group. 

P (chance accuracy) = 1/5 P (choice from 
presented group) 


»-number of words in the presented 
group. 

The probability of choice from the pre 
sented group is empirically determined fro 
responses to blank slides. 

2. The expected number of chance accura- 


cies (4) when a neutral word is actually 
Presented is 


NP (chance ace ; 

М = number of guesses to neutral words, 

3. However, the expected value of © 
(сһапсе accuracy) will also be increased В 
4') bya Propensity to choose neutral wor’ 


uracy) = 4 
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TABLE 1 


INSIGNIFICANT RESPONSE Bias 


Response group for bias measure 


Responses to blank slides 

All errors 

Errors naming stimuli never presented 

Responses to unrecognized neutral words 

Responses to unrecognized emotional 
words 

All errors, corrected 

Responses to unrecognized neutral words, 
corrected 

Responses to unrecognized emotional 
words, corrected 


Males А Females t 

+3.44 .404 — 2.88 514 
+6.68 1.462 — 3.60 .651 
+7.28 1.000 — 14 .016 
4-6.92 ‚739 + 1.16 138 
+5.78 1.067 —10.72 1.630 
+6.38 1.837* — 3.60 724 
+6.78 1.187 + 1.30 7 

+6.06 1.338 — $48 1.601 


* p <.10 using a two-tailed test, the only such result. 


Notc.—Bias equals percentage emotional response less 
he effects of chance accuracy. 


females, “Corrected measures were corrected for t 


for which the amount of response bias 
Was mot statistically significant for 
either sex on any measure, i.e., there 
was little reason to expect a great 
amount of shared variance due to re- 
sponse bias. Bias demonstrated in re- 
Sponses to blanks and bias in re- 
sponses to unrecognized words correlate 
(Pearson r —.53, p <.001). Bias in 


(ie, by response bias). A correction for 
chance accuracy, adjusted for response bias, 
is, 

NP (chance accuracy) +¢ NP (chance 
accuracy) = A’. 

c — P (neutral to blank) — .50 

Here, "c" is the proportion of neutral re- 
sponses given when blanks are presented less 
50—the proportion suggesting no response 
bias. Such an adjustment is appropriate 
whether or not the subject’s bias is greater 
than .50. It will appropriately reduce his 
NP (chance accuracy) should he tend not 
to give neutral responses. 
a 4. The corrected number of ncutral guesses 
© neutral stimuli. NN (corrected), is the 
number of inaccurate neutral responses 
(NN) to neutral stimuli plus the number 
accurate by chance (4'). 

NN (corrected) = ММ + A' 
ee. grec correction did not make sta- 
DM y significant changes in response-bias 
accu Measures unaffected by chance 
bees. cy indicate this was as it should have 


percentage neutral response averaged for 26 males or 


responses to unrecognized stimuli and 
bias in choices from paired words also 
correlate significantly (r=.26, p 
<.05), although at the lower level ex- 
pected because of the differences in 
stimulus conditions and instructions, 
which stress accuracy in the first case 
and response preference in the second. 
Bias in responses to blanks and that in 
choices have a correlation of .22 (.231 
is significant at р <.05). Further in- 
formation on the agreement of meas- 
ures is given in Table 1, which shows 
the mean for each measure. 


Results 

Response Bias. Although stimuli, 
subjects, and tachistoscopic settings 
were chosen with the intent of obtaining 
perceptual defense, the response bias 
one might expect was absent. On all 
measures obtained during stimulus 
presentation, response bias was both 
slight and statistically - insignificant 
(Table 1). Only bias in all errors, 
corrected for chance accuracy, ap- 
proached being significantly different 
from zero. This is true only for per- 
centage emotional response less per- 
centage neutral response for males 


(+6.38%, t= 1.837, p<.10). One 
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TABLE 2 
SIGNIFICANT PERCEPTUAL DEFENSE 
M t 

Males —19.24 2.077** 
Females +17.34 2457** 

Sex 36.58 2.982*** 

** p <.05. 

жер <0. 


Note.—Perceptual defense is percentage Брита 
sp 55 d al response averaged for 
response less percentage neutr: ise а 
the accurate responses of 26 males or females. Prob- 
abilities are two-tailed. 


such approach to significance would 
probably be expected in any study 
yielding over 10 means. The difference 
between sexes is insignificant (¢ = 
1.547). Free choices from paired, 
clearly presented stimuli also fail to 
demonstrate significant bias. Mean 
differences were —.34% for males, 
+4.10% for females. Neither value 
differs significantly from zero. The 
difference between sexes is statistically 
insignificant (# = 765), 

Trends in the data suggest that males 
preferred emotional words. A female 
preference for neutral words is sug- 
gested by (a) the measures suggesting 
the most bias (responses to unrecog- 
nized emotional words), (5) the meas- 
ure based on the most responses (all 
errors), and (c) most of the bias 
measures, 

Perceptual Defense. According to a 
response-bias hypothesis, significant 
perceptual defense should be absent and 
any trends which are present should 
show males recognizing emotional 
words more easily and females recog- 
nizing them less easily than the neutral, 
control stimuli. However, significant 
perceptual defense is present for hoth 
males and females (who show a sex 
difference like that demonstrated hy 
Brown). 

For both males and females, the 
stimuli which are recognized most 
readily are the opposite of those pre- 
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dicted by a response-bias hypothesis. 
Of the four predictions possible, a re- 
sponse-bias hypothesis led to the one 
which is incorrect for both groups of 
subjects. 

A response-bias hypothesis also sug- 
gests that bias in response to blanks 
(which do not restrict bias) will be 
equal to or greater than perceptual de- 
fense shown by responses to stimuli 
(which do restrict bias). Such an ex- 
pectation follows directly from the no- 
tion that it is the characteristics of 
responses, rather than the emotion- 
arousing properties of stimuli, which 
cause perceptual defense; its test uses 
the response-bias most frequently men- 
tioned in previous discussion and re- 
search (Goldiamond, 1958, 1960 ; Gold- 
stein, 1962). However, because of the 
results of Mathews and Wertheimer 


(1958) an opposite prediction was 
made. Perceptual defense was ex- 
pected to be “over-and-above” re- 


sponse bias. As predicted, perceptual 
defense proved significantly greater 
than response bias, both for males (t = 
1.734, р <.05) and for females (t= 
2.010, p <.05). In general, perceptual 
defense is greater than response bias ОП 
any measure. 

Finally, the response-bias hypothe- 
sis implies a prediction of the directio" 
of threshold differences. The suppres- 
sion of socially unacceptable responses 
or the omission of unexpected responses 
could produce a false impression 0 
“perceptual” defense predictable fro™ 
response bias measures, However, ЇЇ! 
this study, response-bias and perceptue 
defense means generally differ in direc 
tion. For individual subjects, predic- 
tion of the direction of а recognitio" 
threshold difference from bias in Te 
sponse to blanks is wrong for 31 of "n 
52 subjects, Only 26 errors would Р 
expected by chance, 

None of the implications of а agi 
sponse-hias hypothesis are ирро! 


re- 


) 
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by the study. In fact predictions made 
on the basis of pure chance would usu- 
ally be successful more frequently. 


Towarp A More COMPLETE CONCEP- 
TION OF ACCURATE RESPONSE TO 
EworioN-AnovsiNG STIMULI 


This article has criticized the over- 
simplification of definition and design 
found in early studies of perceptual 
defense and in more recent studies of 
a response-bias hypothesis. 

When response bias and perceptual 
defense were measured nearly simul- 
taneously, the implications of the re- 
Sponse-bias hypothesis proved more 
wrong than right. Although subjects 
and stimuli were selected by objective 
criteria designed to increase the likeli- 
hood of perceptual defense (and it did 
prove present) response bias was ab- 
sent. Both individual and group per- 
ceptual defense tended to be in a direc- 
tion opposite to that which would be 
predicted by the hypothesis. 

Further evaluation of other response- 
bias and perceptual defense studies will 
Support these findings and suggest the 
need for a more complete conception 
Of response tendencies in the per- 
Ceptual defense experiment. 


Response Bias 


A serious, programatic attack on the 
Measurement of response tendencies 
has heen provided by Goldstein and his 
associates (Goldstein, 1961, 1962; 
Goldstein, Himmelfarb, & Feder, 1962; 
Goldstein & Ratleff, 1961). 

One study (Goldstein, 1961) indi- 
Cates a relationship between associative 
reaction time and the subject’s anxiety 
level, independent of the complexity of 
the habit hierarchy to the word. This 
finding, like the findings of Sarason 
(1959) and others, supports the use of 
а word association test as a technique 
for the selection of emotion-arousing 
Stimuli. Of course, many artifacts 


must be avoided (e.g., the contribution 
of the subject's mean reaction time, 
general familiarity of the word, famili- 
arity within an appropriate norm 
group, the presence of words all sub- 
jects prefer not to discuss, etc.). 

Using a word association test, Gold- 
stein (1962) selected emotion-arousing 
words for which the subject's associa- 
tive reaction time was longer than for 
structurally similar neutral words of 
equal frequency in English. Subjects 
were randomly assigned to (a) a group 
to which blank slides were repeatedly 
presented, (b) a group to which se- 
lected stimuli were very rapidly pre- 
sented, or (c) a group to which stimuli 
were presented with regularly increas- 
ing clarity. All groups showed a bias 
against selected "emotional" stimuli. 
For the blank slide group, bias was not 
significantly less than for the group re- 
peatedly presented the selected neutral 
and emotional words. In this compari- 
son, male and female results were 
averaged. 

Superficially inspected, the findings 
conflict with those reported by Blum 
(1955), Mathews and Wertheimer 
(1958), Minard (1963), and Nelson 
(1955). All report perceptual defense 
greater than response bias. 

More carefully inspected, it is evi- 
dent that Goldstein's study really serves 
as a useful critique of certain concep- 
tions of perceptual defense. 

Unpleasant and socially unacceptable 
stimuli were allowed among the selected 
emotion-arousing words and a bias 
against them is evident in the data. 
This significant response bias serves to 
invalidate the measure of perceptual 
defense. А 

This is probably especially true for 
females, who avoid saying the emo- 
ig words under all experi- 
mental conditions, although females 
recognized highly emotion-arousing 
words especially readily in studies by 


tion-arousit 
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Brown (1961) and Minard (1963). 
Even in Goldstein’s data, mean female 
accuracy to emotional words is greater 
than mean male accuracy, although 
their “pseudo accuracy” (to blank 
slides) is significantly less. A pseudo 
accuracy was a correspondence of the 
subject’s response with a word on the 
list used in the stimulus presentation 
session. The results for accurate re- 
sponses of the male and female groups 
fail to predict the significant difference 
in pseudo accuracy to blank slides, 
since the sex differences go in opposite 
directions in the two groups of re- 
sponses. Such a differential effect 
suggests that response bias interacts 
with stimulus characteristics to deter- 
mine the probability of accurate re- 
sponses when stimuli are actually 
presented. 

Nevertheless, the procedures used 
follow directly from a popular (but 
restricted) definition of perceptual de- 
fense and from the implications of a 
response-bias hypothesis. Goldstein’s 
study warrants comment not because its 
procedures are inferior to those typi- 
cally used, but because they are un- 
fortunately frequent and often not so 
well thought out or reported. 

Averaging across subjects differing 
in sex and personality and across stim- 
uli differing in emotion-arousing prop- 
erties might be expected to average out 
individual differences in perception, 
leaving, as a residual, verbal response 
biases generally Supported in the cul- 
ture. 

Altered Experimental Situations, 
The influence of response tendencies on 
perceptual defense has also been stud- 
ied by changing the responses required 
of subjects. Results have been given 
perceptual-defense interpretations (e.g., 
Blum, 1954; Nelson, 1955) and have 
also been used to question them (Noth- 
man, 1962; Postman, Bronson, & 
Gropper, 1953). Refutations of per- 


Minarp 


ceptual defense have typically used 
socially unacceptable stimuli. The al- 
most certain presence of response bias, 
when public report of such stimuli 15 
required, obscures the meaning of Te- 
sults. For any stimuli, a change in 
instructions may well change the GE 
perimental situation, e.g., it may change 
the effective level of emotion aroused 
by the stimulus. Changes in the event 
to be reported may change the defini- 
tion of recognition, e.g., when the sub- 
ject states which stimulus was “more 
clear” or “stood out the most” his 
task may not involve recognition of the 
whole stimulus at all. It is interesting 
that on very brief presentation d 
tion-arousing stimuli are described 4% 
"more clear," whereas on longer р 
sentation they are not (Mattson © 

Natsoulas, 1962). However, the early 
clarity of details of emotion-arousins 
stimuli might have an effect which 
either helps or hinders in the process 
of stimulus recognition, Understand- 
ing the effects of changes in the exper 
mental situation may require the simu” 
taneous measurement of perceptua 
defense, response bias, and “stimulus 
clarity.” At present, changes cause 
by altering the experimental situation 
do not contribute greatly in understanc 

ing the contribution of response bias = 
perceptual defense, but they might co™ 

Plicate the task of this review. There- 
fore, when dealing with the relation ° 
response bias to perceptual defens? 
this review uses perceptual defensi 
studies (or parts of studies) in ee 
subjects were instructed to achieve 4 o 
curate stimulus recognition (i.e a 
name the presented stimuli or to пате о 
stimulus and its location if stimuli 2 

Presented simultaneously). 


Perceptual Defense 


: ad 

Mathews and Wertheimer ae 
perceptual defense greater than Hy 
sponse bias in subjects high on the ^- 
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scale of the MMPI. Their study 
serves as an example of several studies 
similar in certain respects: All have 
(a) measured perceptual defense and 
response bias nearly simultaneously, 
(b) used responses naming presented 
stimuli as at least one measure of “per- 
ceptual defense contaminated with re- 
sponse bias," (c) used responses nam- 
ing unpresented stimuli (expected by 
the subject) as at least one response- 
bias measure, (d) provided subjects 
with a small sample of possible re- 
sponses from which they must try to 
make an accurate forced choice, and 
(е) used repeated stimulus presenta- 
tions at a level allowing a better-than- 
chance probability of accurate response 
rather than an ascending method of 
limits terminated by one accurate re- 
sponse. These studies include those of 
Blum (1955), Mathews and Wert- 
heimer (1958), Minard (1963), and 
Nelson (1955). 

All have found evidence for per- 
ceptual defense, over and above the 
response bias which may contaminate 
а measure of perceptual defense. This 
independence of perception and indi- 
cator response may be present in other 
studies as well, but obscured by a 
failure to measure response bias or to 
control for its effects on perceptual 
defense measures. 

The studies provide replications 
which check on possible artifacts and 
Suggest generality for the findings. For 
example, Nelson (1955) replicated 
Blum (1955) but equated stimuli more 
carefully, Both simultaneously pre- 
sented Blacky pictures in groups of four 
and asked subjects to name stimuli and 
their locations. Goldstein, Himmel- 
farb, and Feder (1962) have demon- 
Strated “stimulus-location” bias (with 
Words) but Nelson attempted to con- 
trol for this with a preexperimental 
Presentation of stimuli. 

he author extended the results of 


Mathews and Wertheimer (1958) to 
accurate responses by groups differing 
insex. The following reanalysis of por- 
tions of his data provided more direct 
replication and support. The differ- 
ence between percentage of neutral re- 
sponse and percentage of emotional re- 
sponse was obtained for responses 
naming words in the group presented 
(excluding a group of grossly inac- 
curate stimulus response combinations 
so defined before the experiment). A 
reliability coefficient (correlating half 
these responses with the remainder of 
them) was computed for the 52 males 
and females of this study. It was .38 
(p <.005), suggesting a moderate but 
significant reliability despite the pres- 
ence of both response bias and stimu- 
lus recognition, Using a nonparametric 
Wilcoxon T (Siegel, 1956) a signifi- 
cant difference was found between this 
measure and bias in responses to blank 
slides for both males (22.68%, p <.01) 
and for females (20.32%, p <.05). 
Within the sample of responses to 
blanks, the difference between bias in 
responses naming presented words and 
bias in responses naming unpresented 
words was not significant for males 
(t = .156) or females (t= 381). Be- 
cause of the nature of the measures 
used in both the Mathews and Werthei- 
mer study and its replication, the 
“contamination” by response bias could 
not in itself raise the perceptual defense 
measure above the level of response 
bias measured independently. It was 
the characteristics of presented stimuli, 
not of responses, which produced per- 
ceptual defense over and above sub- 
jects’ response bias. 


Conclusions 

An overall interpretation. may be 
made on the basis of response-bias 
studies such as that by Goldstein 
(1962) and perceptual defense studies 
such as that reported here. The prob- 
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ability of an accurate response to a 
stimulus may be a result of the inter- 
action of (a) bias against certain rc- 
sponses and (b) emotion-arousing 
properties of the presented stimuli. By 
appropriate experimental design, the 
effect of either variable may be greatly 
reduced. Jt is an interesting possibility 
that, due to differences in operant re- 
sponse conditioning or to conditioned 
emotional behavior, the effects of these 
variables may differ from person to 
person. Most important, in the light of 
current controversy, is the evidence 
that perceptual defense does occur, 
independent of the nature or presence 
of response bias, 
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A. model for short-term memory is described and evaluated. A variety 


of experimental data are shown 
statements. 


gotten because they are interfered w 
not because their traces decay in time. 
item from a very limited primary memory s 
(c) A recently pe 
time. 
be separated by experimental and 


stable secondary store. 


tained in both stores at the same 
independent memory systems can 


analytical methods. 


It is a well-established fact that the 
longest series of unrelated digits, let- 
ters, or words that a person can recall 
verbatim after one presentation seldom 
exceeds 10 items. It is also true, how- 
ever, that one can nearly always recall 
the most recent item in a series, no 
matter how long the series—but only 
if this item may be recalled immedi- 
ately, or if it may be rehearsed during 
the interval between its presentation 
and recall, Otherwise it is very likely 
to be lost. If we may assume that 
attending to a current item precludes 
Teviewing a prior one, we can say that 
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be consistent with the following 


(a) Unrehearsed verbal stimuli tend to be quickly for- 


ith by later items in a series and 
(b) Rehearsal may transfer an 
tore to a larger and more 
rceived item may be re- 
The properties of these 2 


the span of immediate memory must 
be limited in large part by our inability 
to rehease, and hence retain, the early 
items in a sequence while attempting to 
store the later ones. Our limited mem- 
ory span would then be but one mani- 
festation of our general inability to 
think about two things at the same 
time. 

Why should an unrehearsed item 
in a list be forgotten so swiftly? Is its 


physiological trace in some sense writ- 


ten over by the traces of the items 
is trace sim- 


that follow it? Or does | | 
ithi ief interval, re- 
ply decay within a brie interval, 
gardless of how that interval is filled? 
Tradition, in the guise of interference 
theory, favors the first explanation 
(McGeoch, 1932; Postman, 1961), al- 


though some psychologists now think 
that new memory traces must fade 
autonomously in t 1958; 


ime (Brown, ) 
Сопгай, 1957; Hebb, 1949). Until 
no one has re 


now, ported any data 
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which clearly contradict either of these 
ideas. In fact, when we first consid- 
ered the problem of the instability of 
recent memory traces, we thought it 
entirely possible that both decay and 
interference operate over brief reten- 
tion intervals to produce forgetting, 
and we therefore designed an experi- 
ment to weigh their respective effects, 
The results of this experiment were un- 
expectedly straightforward—and seem- 
ingly inconsistent with certain other 
existing data on immediate retention. 
We have been able, however, to for- 
mulate a simple quantitative model 
which relates our results to those Te- 
ported by other investigators. What 
began as an attempt to evaluate two 
very general hypotheses about the for- 
getting of recent events has therefore 
resulted in a specific theory of short- 
term memory. 

We shall describe our experiment in 
Section I below. A major portion 
of this paper, Section II, will be con- 
cerned with the description and appli- 
cation of our model. In Section ITI we 
shall discuss this model in relation to 
the general question of whether short- 
and long-term retention represent 
distinguishably different psychological 
processes, 


I. Pnosr-DrciT EXPERIMENT 


а minimally rehearsed 
i of the num- 
f other items 
general pro- 


Lists of 16 single 
digits were prepared with the aid of a 
standard table of random numbers, under the 
hould appear more 
The last digit in every 
list was one that had occurred exactly once 
before, in Position 3, 5, 7, 9, 10, 11, 12, 13, or 
14. On its second appearance, this “Drobe- 
digit” was a cue for the recall of the digit 
that had followed it initially, 

The lists were recorded on two magnetic 
tapes; they were read in a monotone voice 
by a male speaker at a constant rate of 


either one or four digits per second. Each 
of the nine possible probe-digit positions 
was tested 10 times. The two tapes A 
cordingly contained 90 test lists (plus 8 Re 
tice lists) apiece, all read at the same en 
The last digit in every list, the probe-digit, 
Was accompanied by a high-frequency ton 
to aid the subject in detecting the end 9 
the list. The position of the initial fuu. 
tion of the probe varied randomly from Hs 
to list on each of the two tapes. the 
The subject’s task was to write down in 
digit that had followed the probe digit 28 
the list, guessing if he did not know. | Sin 
the probe-digit was unique in Positions j 
through 15, there was only one possible can 
rect answer on any trial. Every suber 
listened to the list through earphones for | 
total of 12 experimental sessions, 6 with aa 
tape, alternating between fast and slow p 
The first session under each condition "€ 
the first eight lists listened to in each d 
sion were considered to be practice and, Ч 
known to the subject, were not scored. | ns 
The subjects received explicit instruction 
to control rehearsal by “thinking only б 
the last digit you have heard and neve ns 
any of the earlier ones.” These instruction 
Were repeated before the second session, P 
Occasional reminders were given throug d a 
the course of the experiment. Thus, the she 
jects were to rehearse every item during it. 
interitem interval immediately following T. 
Our instructions were not designed to € Mid " 
nate the rehearsal of single items as Й 
but rather to eliminate the rehearsa ally 
groups of digits. The experiment actu 
tested the retention of a digit раі, ten- 
Probe-digit and its successor. The Be the 
tion of this pair should be independent © ої 
interitem interval, if the instructions to UN 
Srouping were followed faithfully. 
hoped, in effect, to test the retention es 
unrehearsed pairs of digits under two Г 
of presentation, der- 
The subjects were four Harvard Un 
graduates, three males and one female. d to 
he responses Were scored and amm of 
Yield a seria] Dosition curve for each He cy 
Presentation, relating the relative freq tance 
of an item's correct recall to its die f 
from the end of the list. A comparis© Ja- 
the two functions allows us to assess mes 
tive effects of decay and interferena ol- 
short-term forgetting, according to t recal 
lowing line of reasoning. Consider the f the 
of Item i from the end of the line. per 
list was read at the rate of one ite ene 
second, then i items would have pue 
and i seconds would have elapsed b€ 
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m time the subject heard the item and the 
ше he attempted to recall it. (We count 
re peo appearance of the probe-digit as 
M dodge event.) If the items were 
EN UM UN rate of four рег second, on the 
d and, then only i/4, rather than i, 
Feria would have elapsed between the oc- 
is о: of Item i and the subject's attempt 
Ee it. A total of i other items would, 
i still have intervened between these 
: mne. Therefore, if the probability of 
Sc ing Item i from the end of a slow list 
тава with the probability of recall- 
ies m 4i from a fast list, we could con- 
e i ar recent memory traces decay in 
€ in zee көрд of one another. Con- 
Id i the probability of recalling Item 
pe invariant with rate of presentation, we 
uld conclude that rapid forgetting is caused 
primarily by retroactive interference. 


The results of the experiment are 
shown in Figure 1. The scores for 
the individual subjects are presented 
in Figures 1A and 1B. The pooled 
data, corrected for guessing, are shown 
in Figure 1C.? Each point in Figures 
1A and 1B is based on 50 observations; 
each point in Figure 1C, on 200. It 
is evident that there are consistent dif- 
ferences among subjects, but little in- 
teraction between subjects and serial 
positions. Furthermore, although there 
appears to be a slight interaction be- 
tween relative frequency of recall, or 
R(i), and rate of presentation, it is 
clear that the effect of rate is relatively 
small compared to the effect of serial 
position. The main source of for- 
getting in our experiment was inter- 
ference. 

The differences between the two sets 
9f points shown in Figure 1C are not 
Statistically reliable, according to an 
analysis of variance performed on the 
number of items recalled by each sub- 


mee response | set—the 10 digits—was 
the wn to the subjects, and they knew that 
digit oe would not be the same as the test 
ing £1 Thus the probability of correctly guess- 
malizi, answer, g, was 1/9. A standard nor- 
Sects o technique was used to eliminate the 
P(r S of guessing from the data, namely, 
ecall) = [p(correct) — g]/(1 — 9). 


R(i) 


в) 


1.0 [ 
b~o RATE 
eo 
usb N о | PER SECOND 
S © 4 PER SECOND 
r \ --- AVERAGE 
\ 
Leer "n 
= f x 
04 


NUMBER OF INTERFERING ITEMS (i) 


Fic. 1. Results of the probe-digit experi- 
ment. (Figures 1A and 1B represent re- 
tention functions for jndividual subjects 
under two rates of presentation ; in Figure 
1C these data have been pooled.) 


ject at each value of 7 under the two 


rates of presentation (F <1 for the 
mean square between rates tested 
against the interaction. between sub- 
jects and rates). This conclusion is 
borne out by the results of nine Kol- 
mogorov-Smirnov two-sample tests, 


one for each value of i, performed 


on the distributions of number of items 


recalled per subject per session under 
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the two rates of presentation. We 
have therefore fitted the points shown 
in Figure 1C with a function that 
represents the probability of recalling 
Item i from the end of a series, esti- 
mated across rates of presentation. 
This function decreases monotonically 
with i, attaining a value of about .07 
at i = 12. 


П. MODEL ror Primary Memory 


When we compared the foregoing 
results with the typical outcome of the 
first trial in a standard list-learning 
experiment, we found ourselves facing 
two dilemmas. In the first place, it 
often happens that an item in a long 
list is recalled after 10 or 20, or even 
more, items have followed it. But in 
our experiment, probability of recall 
was effectively zero for the eleventh 
item in from the end of a list. In the 
second place, various investigators have 
shown that probability of recall in- 
creases with presentation time (see 
Posner, 1963), yet in our experiment 
this probability, for all practical pur- 
poses, was independent of the rate at 
which the digits were read. 

In seeking for a way to account for 
these discrepancies, it occurred to us 
that one difference between our ex- 
periment and previous ones in this area 
is that we instructed our subjects not 
to think about any item in a list once 
the next had been presented. This 
instruction to avoid rehearsal is, to be 
sure, rather unorthodox, although not 
completely without precedent (Under- 
wood & Keppel, 1962). In order to 
minimize rehearsal, many experiment- 
ers try to keep the subject so busy that 
he does not have time to rehearse; but 
we think it highly likely that a well- 
motivated subject who is trying to learn 
a list will rehearse unless specifically 
enjoined from doing so. The typical 
subject’s account of how he learns a list 
(Bugelski, 1962; Clark, Lansford, & 


Dallenbach, 1960) bears us out on this 
point. In fact, it is probably very 
difficult not to rehearse material that 
one is trying to memorize. 

We shall assume here that rehearsal 
simply denotes the recall of a verbal 
item—either immediate or delayed, si- 
lent or overt, deliberate or involuntary. 
The initial perception of a stimulus 
probably must also qualify as a T&- 
hearsal. Obviously a very conspicu- 
ous item or one that relates easily to 
what we have already learned can be 
retained with a minimum of conscious 
effort. We assume that relatively 
homogeneous or unfamiliar materia’ 
must, on the other hand, he deliberately 
rehearsed if it is to be retained. Ae 
tually, we shall not be concerned here 
with the exact role of rehearsal in m 
memorization process, We are simply 
noting that, in the usual verbal-learn- 
ing experiment, the likelihood that at 
item in a homogeneous list will be д 
called tends to increase with the amoun 
of time available for its rehearsal. The 
probe-digit experiment has shown, co!" 
versely, that material which is not 1 
hearsed is rapidly lost, regardless ja 
the rate at which it is presented. It s 
as though rehearsal transferred a T° 
cently perceived verbal item from One 
memory store of very limited барас 
to another more commodious Vig 
from which it can he retrieved at * 
much later time. T 

We shall follow James (1890) m 
using the terms primary and second? 
memory (PM and SM) to denote ae 
two stores. James defined these get 
introspectively : an event in PM F 
never left consciousness and is part M 
the Psychological present, while ре 
event recalled from SM has been ? 
sent from consciousness and belong. he 
the Psychological past. PM is a fa! 1 
ful record of events just perceived ; mes 
is full of gaps and distortions. io 
believed that PM extends over a 1 
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period of time. We propose instead 
that it encompasses a certain number 
of events regardless of the time they 
take to occur. Our goal is to distin- 
guish operationally between PM and 
SM on the basis of the model that we 
shall now describe. 

Consider the general scheme illus- 
trated in Figure 2. Every verbal item 
that is attended to enters PM. As we 
have seen, the capacity of this system 
1s sharply limited. New items dis- 
Place old ones; displaced items are 
permanently lost. When an item is 
rehearsed, however, it remains in PM, 
and it may enter into SM. We should 
like to assume, for the sake of sim- 
Plicity, that the probability of its en- 
tering SM is independent of its posi- 
tion in a series and of the time at which 
lt is rehearsed. Thus, it would not 
Matter whether the item was rehearsed 
Immediately on entering PM or sev- 
ега] seconds later: as long as it was 
in PM, it would make the transition 
into SM with fixed probability. (Our 
PM is similar to Broadbent's, 1958, P 
System, One difference between our 
two systems is that ours relates re- 
hearsal to longer term storage, whereas 
his does not.) 
| Finally, we shall assume that re- 
Sponse-produced interference has the 
Same effect on an item in PM as does 
iu mulus-produced interference. That 

; the probability that an item in PM 
m be recalled depends upon (a) 
d T new items have been per- 
e] plus (b) how many old ones 
idi Deen recalled between its presen- 
ái On and attempted recall Thus, if 

1 item appears in Position from 
ies ie of a list and the subject at- 
other v to recall it after recalling m 
айва i it is as if the item had 
ist f ed in position 7 — n + m in the 
end E recall was attempted at the 
таео the list. This assumption is 

Strong, but recent studies by 


SECONDARY 
MEMORY 


PRIMARY  [RERERRSRL / 


MEMORY 


FORGOTTEN 


Fic. 2. The primary and secondary mem- 
ory system. (All verbal items enter PM, 
where they are either reheased or forgotten. 
Rehearsed items may enter SM.) 


Murdock (1963) and by Tulving and 
Arbuckle (1963) have, in fact, failed 
to reveal any consistent differences be- 
tween stimulus- and response-induced 
interference in the retention of paired 
associates. It may not be unreasonable 
ito suppose, therefore, that the two 
sources of interference exert equivalent 
effects on free and serial recall. 

According to our hypothesis, then, 
the probability of recalling an item 
which has been followed by i subse- 
quent items is given by the probability 
that it is in PM, in SM, or in both. 
Assuming that these probabilities com- 
bine independently, 


R(i) = Р(ї) + S() = P(i)SQ) [1] 


where R(i) is the probability that Item 
i will be recalled, P(i) is the probabil- 
ity that it is in PM, and S(i) the 
probability that it is in SM. The 
probability that this item is in PM 
is then given by 
P(i) = [Rù 
n = Sue = 5001. [2] 


We assume that P(i) is а monotonic 
decreasing function ОЁ? and that 

lim P(i) = 0. 
We should like specifically to test the 
hypothesis that P(i) is independent 


of the value of S(i) and, in fact, varies 
with 7 in the manner of the probe-digit 


data. (This hypothesis is stated more 
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formally in the Appendix.) In order 
to do so, we need data on verbal re- 
tention that meet the following re- 
quirements. 


1. They should come from an ex- 
perimental situation where at least 
some of the items are retrieved from 
PM. 

2. The subject should have been al- 
lowed to rehearse, so that S(i)> 0. 

3. The value of S(i) should prefer- 
ably be constant and independent of i. 

4. The experimental lists should be 
long enough to let us estimate S(i) 
for i> 12. 


5. We should know the location of a 
given item in the stimulus list (л) and 
in the recall list (71), so as to be able 
to estimate the total number of inter- 
fering items (i= n + m). 


Free Recall 


The free-recall experiment is well 
suited to our purposes. Subjects can 
(and usually do) recall the last few 
items in a list right away, and the 
middle portion of the serial position 
curve (after the first three and before 
the last seven items) is effectively flat, 
thereby providing a convenient esti- 
mate of S(i) (Deese & Kaufman, 
1957; Murdock, 1962; Waugh, 1962). 

Testing our hypothesis against data 
collected in a free-recall experiment 
therefore involves the following steps: 


1. First, we estimate S(i) from the 
average proportion of items recalled 
from the middle of a long list. 

2. We then estimate P(i) for each 
of the last seven items in the list by 
Equation 2. 

3. We plot this estimate against 
n-+ m = and compare the resulting 
function with that shown in Figure 1. 


Fortunately, we did not have to per- 
form a free-recall experiment especially 
for this purpose: severa] such studies 


have been carried out and reported 
in sufficient detail to enable us to test 
our hypothesis against their results. 
We have chosen to analyze four sets of 
data collected by three different in- 
vestigators: Deese and Kaufman 
(1957), Murdock (1962), and two as 
yet unpublished experiments conducted 
by Waugh. The two principal vari- 
ables that affect S(i) in free recall 
appear to be length of list (the amount 
of material that is to be retained) and 
presentation time (the amount of time 
available for the rehearsal of a given 
item). Manipulating these variables 
results in orderly changes in the value 
of S(i), so that our estimates range 
from .08 to .45 across the four experi- 
ments. 

1. In Deese and Kaufman’s study, 
the subjects listened to lists of 32 un- 
related English words read at a rate 
of one per second, and began recalling 
them immediately after the last had 
been spoken. Deese and Kaufman 
have presented a serial position curve 
based on these data and have also d 
ported the relation between an item 5 
Serial position in recall and its posi- 
tion in the original list. We can 
thereby estimate i for each item in 
their lists, letting an item's averagt 
position in recall be our estimator ° 
the amount of response interference 
(т). We estimated 5 (i) by the pro- 
portion of items recalled after the first 
three and before the last seven seria 
positions in the original list: (Th! 


3 It is not really correct to use the averag^ 
of the serial positions in recall as an eso 
of m+1: the total effect of response ns 
ference should depend on the variance of t = 
distribution as well as on its mean or 5 
dian. It is the only alternative open to Е 
however, since our correction for asymptor 
must be applied to the average proportia 
of items retained, estimated across 507 
position in recall. re- 
4In estimating S(i), we ignored the use 
call of the first three items on a list Беса! 
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same general procedure will be fol- 
lowed in our subsequent analyses.) 

The last seven points of Deese and 
Kaufman’s serial position curve, taken 
from their Figure 1 and corrected for 
asymptote according to Equation 2, 
are plotted as a function of i in Figure 
3. The dashed lines in Figure 3 rep- 
resent the 99% confidence limits for 
the probe-digit function: a standard 
error for each point was estimated 
across subjects and experimental ses- 
sions. The uncorrected data are shown 
in Table 1. 

2. Waugh’s experiments were con- 
cerned with determining the number 
of items freely recalled from long lists 
as a function of presentation time. In 
her first experiment, 24, 30, 40, 60, or 
120 different monosyllabic English 
Words were read to the subjects at a 
Tate of one per second. The propor- 
tion of items recalled varied inversely 
With list length, so that for each length 
Of list there is a different serial posi- 
tion function. The asymptotes of 
these functions range from approxi- 
mately 08 to .20. Median serial posi- 
tion in recall (m + 1) was calculated 
for each of the last six items in a list; 


fer invariably show a primacy effect, per- 
aps the result of selective attention and 
rehearsal, 
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о DEESE AND KAUFMAN 
PROBE DIGITS 


P(i) 


NUMBER OF INTERFERING ITEMS (i) 


Fic. 3. Free-recall data from Deese and 
Kaufman (1957), corrected for asymptote 
and response interference. 


Figure 4 shows S(i) as a function of 
i for each of these items. The uncor- 
rected data appear in Table 2. 

In Waugh’s second experiment, the 
subjects listened to 30 different words 
presented at a rate of 1, 2, 3, 4, or 6 
seconds per word. In each case the 
presentations were either massed—that 
is, each word was read one, two, or 
three times in a row, at a rate of one 
word per second or of one word every 
two seconds—or they were distributed 
—each word was read once at one, two, 
three, four, or six different places in 
a list, at a rate of one word per second. 
The results of this experiment indicate 
that whether the repetitions are massed 


TABLE 1 


Proportion Or ITEMS FRE 


ок SERIAL POSITION AND TOTA. 


ELY RECALLED AS A FUNCTION 


L TIME PER LIST 


| 


X seconds per item 


Number of List length 
ems 32 X1 40 X1 20 X2 30 X1 15 X2 20 X1 10 X2 
0 E .96 .95 97 97 55 35 
1 67 85 88 `89 88 ^ 83 
A 60 71 115 14 80 76 n 
3 42 51 E 52 (62 6 5 
4 .32 .40 43 39 58 39 58 
5 27 27 38 133 49 30 45 
б 22 22 38 24 42 26 45 
es EU ‚12 27 19 38 . | 
N 
SOUS Deese and Kaufman (1957), Column 1; Murdock ( 1961), Columns 2-6, 


Dtrieg 


in this row represent the asymptotic value of RD). 
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WAUGH 


LIST LENGTH PROBE DIGITS 


NUMBER OF 


INTERFERING ITEMS (i) 


Fic. 4. Free-recall data from Waugh 
corrected for asymptote and response inter- 
ference. 


or distributed is of no importance ; 
the probability that a word will be re- 
called is determined simply by the total 
number of seconds for which it is 
presented. Since this probability in- 
creases as a negatively accelerated func- 
tion of presentation time, the asymp- 
totic values of the serial position 
function obtained in this experiment 
ranged from approximately .14 (for 
30 words each read once) to .45 (for 
30 words each read six times). Aver- 
age serial position in recall was again 
calculated for each of the last six items 
in a list. The retention functions for 
massed and distributed repetitions, cor- 


TABLE 2 
PROPORTION OF ITEMS FREEL 


Y RECALLED As 
A FUNCTION or $ х 


ÍULUS INTERFE 


Number of 

intervening 
items 24 30 40 60 120 
0 95 .97 1.00 95 1.00 
1 85 85 90 93 95 
2 92 69 .81 .86 92 
3 42 46 51 „53 57 
4 47 35 ЗІ 32 57 
5 21 17 22 14 14 
Ss 15 17 .16 12 08 


a Entries in this ri 


ow represent the asymptotic value 
of R(n), 


rected for asymptote and response in- 
terference, are shown in Figures 5 and 
6, respectively, along with the. PM 
function obtained in our probe-digit ex- 


periment. The uncorrected data are 
shown in Table 2. 
3. In Murdock's experiment, the 


subjects listened to lists of 20, 30, or 
40 words read at a rate of 1 word 
per second and to lists of 10, 15, and 
20 words read at a rate of 1 word 
every 2 seconds. Murdock found. as 
has Waugh (1963), that the probabil- 
ity of recalling a word that has been 


WAUGH 
Lor. CONDITION 
E A * DI 
NOD o D2 
0.8 NR g 25 
RES 4 
X X m 010175 


NUMBER OF 


items (i) 


INTERFERING 


r- 
Fic. 5. Free-recall data from Waugh va 
rected for asymptote and response inter 


"ES а ег 
ence (1-6 distributed presentations P 
word). 


listened to for 2 seconds is almost 
exactly twice the probability of T° 
calling a word that has been listened 1 
for 1 second. Murdock’s data. d 
therefore be grouped into three pairs 
serial position curves: 10 words ni 
at a rate of 1 every 2 seconds ve! ЕЕ 
20 words read at a rate of 1 per 5€ y 
ond ; 15 words read ata rate of 1 ae 
2 seconds versus 30 read at a rate a 
1 per second; and 20 words read i“ | 
rate of 1 every 2 seconds versus hin 
read at a rate of 1 per second. pron 
each pair, there are two аѕутріо е 
one of which is approximately twice 
value of the other, 
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We have corrected Murdock’s curves 
for asymptote—that is, for S(7)—and 
since he did not calculate serial posi- 
tion in recall for his words, we have 
plotted these corrected values of P(i) 
against the average values of i cal- 
culated by Waugh for words recalled 
under similar conditions in the experi- 
ment just described (see Figures 5 
and 6). Murdock’s uncorrected data 
are shown in Table 1. 

It is clear that an appreciable num- 
ber of the points displayed in Figures 
3 through 7 fall outside the confidence 
limits we have set for the probe-digit 
function. In general, the discrepancies 
between theoretical and observed values 
of P(i) appear to be unsystematic. 
They may have resulted from either 
Of two possible sources which would 
not be reflected in the variance of the 
probe-digit function. 

In the first place, we assume that 
S (i) is constant for all i While S(i) 
does not in fact seem to vary system- 
atically with i in the middle of a list, 
individual words do differ greatly in 
their susceptibility of storage in sec- 
Ondary memory: the serial position 
function for free recall is haphazardly 
Jagged rather than perfectly flat. Thus, 
€ven one anomalously easy word in 
Location n, for instance, can greatly 
Inflate our estimate of R(n) and hence 
P(n). The probe-digit data would 
Presumably not be subject to this kind 
of variability. 

A second source of errors may lie in 
Sur estimation of i, or m+n. We 
have used average position in recall— 
call it т --1—as our estimate of m 
+1. Even a small error in this esti- 
Tate can lead to a sizable discrepancy 
эш а theoretical and an observed 
ier of P(i), especially around the 

D early portion of the function. 

ë The 
We 
lat 


$e asymptotes for Murdock's curves 
ed Obtained by complementing his tabu- 
values for v (shown in his Table 2). 


WAUGH 
1.0 CONDITION 
* м2 
a M3 
А M4 


v M6 
жаш PROBE DIGITS 


ITEMS (1) 


NUMBER OF INTERFERING 


Fic. 6. Free-recall data from Waugh 
corrected for asymptote and response inter- 
ference (2-6 massed presentations рег 


word). 


Errors of this sort would be reflected 
in Figures 4-7, where i and P(i) are 
derived from either partially or com- 
pletely independent sets of data (in 
Figures 4-6 and Figure 7, respec- 
tively). Furthermore, we should in 
any case expect some discrepancy on 
purely mathematical grounds between 
P(i), where (i) is the mean of a point 
distribution, as in the probe-digit ex- 
periment, and P(m- л), where ш 
can assume any of a number of values, 
as in the free-recall data we have 
analyzed. Unfortunately, we are un- 


MURDOCK 
LIST SEC/ITEM 
LENGTH 


e 40 1 
o 30 1 
а 20 1 
x 20 2 
А 15 2 
+ 10 2 

—-- — PROBE DIGITS 


1 3 5 7 9 


NUMBER OF INTERFERING ITEMS (i) 


Fic. 7. Free-recall data from Murdock 
(1961), corrected for asymptote and response 


interference. 
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able to specify the magnitude of this 
expected discrepancy. : 

In view, therefore, of the likelihood 
of the errors we have just described, 
we believe that the fit between the 
probe-digit function and the free-recall 
data is fairly good and is, in fact, 
probably too close to attribute to 
chance. Actually, in one respect it is 
surprising that the probe-digit func- 
tion should describe the free-recall data 
as well as it does. The probe-digit 
experiment tested the retention of 
digit pairs, whereas the free-recall ex- 
periments tested the retention of in- 
dividual items. How are we justified 
in equating the two? One possibility 
is to assume that in the probe-digit 
experiment the subjects perceived and 
stored the digits as a series of over- 
lapping pairs, rather than as single 
digits. In this case, the measure of 
interference would be given by the 
number of digit-pairs that follow any 
given pair, which is, of course, equal to 
the number of single digits that follow 
it. In the free-recall experiment, on the 
other hand, the subjects may have per- 
ceived the words as independent units, 
and the effective interference would 
then consist of single words, as we have 
in fact been assuming. The problem, 
then, can be restated as follows: why 
do pairs of digits and single words 
exert equal amounts of retroactive in- 
terference on like items in primary 
memory? There is little in the exist- 
ing literature that sheds much light on 
this point. 


Paired Associates 


Our model should, of course, be able 
to describe ordered as well as free re- 
call. We face serious problems, how- 
ever, in attempting to apply it to 
serial learning : if a list is long enough 
to furnish a stable estimate of S (1), 
the probability that a given item will 
be in PM at the time of testing is negli- 


gible, since serial items are custom- 
arily tested in the order in which they 
were presented. We must therefore 
turn to paired associates. In a recent 
study, Tulving and Arbuckle (1963) 
systematically varied the positions of 
the items on the recall list, and we 
have therefore applied our hypothesis 
to their data in the manner described 
above. 

Tulving and Arbuckle presented 
number-word pairs to their subjects 
and then tested for the recall of each 
word by presenting only the number 
with which it had been paired. They 
were interested in measuring probabil- 
ity of recall after one trial as a function 
of an item's serial position in both the 
original list and the test list, We have 
estimated S(i) by averaging the recall 
probabilities for i> 13, excluding 
Items 1 and 2. The value of their 
serial position curve is fortunately con- 
stant in this region, as it was for free 
recall. Note that in this task, each 
pair presented after a given number 
and before the cue for its recall actually 
consists of two interfering items: @ 
word plus a number. We have counted 
all items occurring between the test 
item and its recall—including the test 
number—as interfering items. We 
have analyzed the Proportion of item? 
presented in Positions 1 through 
from the end of the stimulus list an 
tested in Positions 1 through 6 of the 
Tesponse list. These proportions are 
shown in Tulving and Arbuckle $ 
Tables 2 and 4; we have pooled those 
that correspond to a given value of t. 
Thus i, or n+ m (where n = f ES 
m=i—j), ranges from 1 to 11. 
These data are presented in Figure he 
along with our own estimate of P (Ù); 
Again, considering the variability ° 
S(i) that is not taken into account 
our model, the fit between data ап 
theory appears to be reasonably 8000: 

In sum, then, we believe we can 52 
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that the similarity between our probe- 
digit function and the various other, 
initially disparate, serial position curves 
shown in Figures 3-8 is consistent with 
the hypothesis that there is a primary 
memory store that is independent of 
any longer term store. The capacity 
of the primary store appears to be 
invariant under a wide variety of ex- 
perimental conditions which do, how- 
ever, affect the properties of the longer 
term store. 


Single-Item Retention 


Much of the experimental work on 
memory in the past 5 years has fo- 
cused on measuring the retention of a 
single verbal item—or of a brief list 
of items—over short intervals. А 
widely used procedure which was in- 
troduced by Peterson and Peterson 
(1959) is to expose an item (for 
example, a meaningless three-letter se- 
quence) to a subject; have him per- 
form some task that presumably mo- 
nopolizes his attention (such as count- 
ing backwards by three's) for a 
Specified number of seconds; and, 
finally, at the end of this interval, have 
him attempt to recall the critical item. 
The universal finding has been that 
retention decreases monotonically with 
the length of the retention interval. It 
has generally been assumed that the 
Subject does not rehearse during the 
retention interval, that a number spoken 
by him does not interfere with a 
trigram previously spoken by the ex- 
Perimenter, and that therefore the ob- 
Served decline over time in the reten- 
tion of such an item reflects the pure 
decay of its memory trace. This gen- 
ега] conclusion is clearly inconsistent 
e our results, since we have found 
s the length. of the retention interval 
E Such—within the limits we tested, 
aturally—is of relatively little im- 
Portance in determining retention loss. 

In seeking for a way to account for 


this discrepancy, it occurred to us to 
question the assumption that, in an 
experiment of the sort described above, 
the numbers spoken by the subject dur- 
ing the retention interval do not inter- 
fere with the memory trace of the item 
he is supposed to retain. Some ex- 
perimenters have, after all, reported 
that dissimilar items seem to interfere 
with one another just as much as do 
similar ones in the immediate recall 
of very short lists (Brown, 1958; Pills- 
bury & Sylvester, 1940). What would 
happen, therefore, if we were to define a 
three-digit number uttered by a subject 
in the course of a simple arithmetic cal- 
culation—counting backwards—as one 
unit of mnemonic interference? Could 
our model then describe the forgetting 
of single items over brief intervals? 
We have attempted to fit the data of 
two experimenters, Loess (in press) 
and Murdock (1961), by converting 
the retention interval into a corre- 
sponding number of interfering items. 
Murdock's subjects were trained to 
count at a steady rate of one number 
per second, so the number of inter- 
fering items in his experiment is equal 
to the retention interval in seconds. 
Loess’ subjects counted at a rate of 
one number every 1.5 seconds; we have 
therefore multiplied the length of his 
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Fic. 9. The retention of three-item lists 
compared with the probe-digit function, 
(Loess' data denote the proportion of con- 
sonant trigrams recalled after various reten- 
tion intervals; Murdock's data represent 
the average proportion of trigrams and 
word triads retained after a given interval.) 


retention intervals by 3 in order to ob- 
tain the equivalent number of inter- 
fering items. We estimate S(i) in 
both cases by the relative frequency of 
recall at i= 18.5 

The two sets of data, corrected for 
asymptote, are shown in Figure 9, 
along with the probe-digit function. 
The correspondence between them is 
reasonably close. It is possible, of 
course, that this agreement between 
theory and fact is simply a matter of 
luck, depending, as it does, on the 
arbitrary assumption that a three-digit 
number generated by the subject him- 
self is psychologically equivalent to a 
one-digit number presented by the ex- 
perimenter during the retention inter- 
val (as in the probe-digit study). 
Obviously we cannot draw any firm 
conclusions about the effect of inter- 
ference on the retention of single items 
until this assumption is justified em- 
pirically. We can only point out that 


6 We have also tried to analyze the results 
of Peterson and Peterson (1959) but with- 
out success. Part of the difficulty may result 
from the fact that their subjects may not 
have adhered strictly to a prescribed rate 
of counting during the retention interval (L. 
Peterson, personal communication, 1964). 
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the results of Murdock and Loess do 
not necessarily contradict our model. 


Discussion 


We should at this point like to con- 
sider the general question of whether 
all verbal information is stored in the 
same system or whether, as we have as- 
sumed here, there are two independent 
mnemonic processes that contribute to 
retention even over very short inter- 
vals. The proponents of a unitary the- 
ory of memory, eloquently led by 
Melton (1963), have argued that re- 
call after a few seconds is affected in 
very similar ways by the variables 
that govern recall over much longer in- 
tervals; and that therefore the distinc- 
tion between a short-term memory 
mechanism, on the one hand, and а 
longer term mechanism, on the other. 
is purely arbitrary. The following facts 
have been cited in support of this argu- 
ment: 


1. Short-term | retention improves. 
just as does long-term retention, when 
the material to be recalled is repeated 
before a test of retention, or when it i5 
repeated — between successive tests 
(Hebb, 1961 ; Hellyer, 1962). " 

2. Retention after a brief delay !5 
subject to proactive interference, as 18 
retention after a long delay (Keppel 
& Underwood, 1962; Loess, in press)- 
Why, asks the unitary theorist, should 
we distinguish between short- and 
long-term retention if we cannot finc 
any quantitative and experimentally ше 
nipulatable differences between ет: 
This question might well be disturbing 
if one took the position that the tw 
processes have sharply defined pe 
overlapping temporal boundaries ENY 
that items recalled within some critic? 
interval after their initial occurrence 
must have been retrieved from опе SY*- 
tem, whereas items recalled beyonc 
this interval must have been retrieve 
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from another. (Such a view would im- 
ply, interestingly enough, that an item 
would have to remain in a short-term 
storage for some specified number of 
seconds before passing into longer 
term storage, if it did so at all.) 


But what if we do not require that 
the two systems be mutually exclusive? 
Then the probability that an item will 
be recalled will depend on both the 
probability that it is still in PM and 
the probability that it has entered into 
SM in the interval between its presen- 
tation and the start of the interfering 
sequence (or even during this sequence, 
if the subject is able to rehearse). All 
those variables that determine S(i) for 
a given item—such as its position in a 
closely spaced series of tests, or the 
number of times it has been repeated 
—will then determine the observed 
proportion recalled after a brief inter- 
val. We believe we have shown, how- 
ever, that P(i) depends only on i and 
remains invariant with changes in 
S(i); and we submit that most of the 
published data on short-term retention 
actually reflect the properties of both 
memory systems. 

We would like to make one final 
point: the existence of some rather 
compelling introspective evidence in 
favor of two distinct mnemonic sys- 
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tems. РМ, as we have defined it here, 
is best illustrated by a person's ability 
to recall verbatim the most recent few 
words in a sentence that he is hearing 
or speaking, even when he is barely 
paying attention to what is being said, 
or to what he is saying. Given that 
the flow of speech is intelligible, failures 
in the immediate recall of words we 
have just heard—errors of either omis- 
sion, transposition, or substitution— 
are probably so rare as to be abnormal. 
Indeed, we believe that it would be im- 
possible to understand or to generate 
a grammatical utterance if we lacked 
this rather remarkable mnemonic ca- 
pacity. In order to recall a sentence 
verbatim at a later time, however, we 
usually have to rehearse it while it is 
still available in PM. 

The same effect holds for meaning- 
less arrangements of verbal items. If 
we present a subject with a random 
string of words, letters, or digits, and 
ask him to reproduce them in any order 
he chooses, he can maximize the num- 
ber he recalls by “unloading” the last 
few items immediately. Most sub- 
jects in free-recall experiments report 
that these very late items tend to be 
lost if they are not recalled immediately, 
whereas items that came earlier in the 
list can be retrieved at leisure, if they 


TABLE 3 


PROPORTION OF 


LY 


RECALLED AS A FUNCTION OF 
ох TME 


Sri MULUS [5 ENCE AND PRESENTATI 
Number of Seconds per item 
ti i = 
tems S Distributed Massed - 
1 2 3 4 6 2 3 4 
0 .96 99  .97 100  .97 .98 A i” 4 
1 82.90.91 .86 — .89 82 95 бз 
2 -716 81 .86 .82 87 15 58 "58 "80 
3 54.64.73 176.65 a a к "d 
4 38 — 40 „50 7 — .60 0 - po i 
5 21 .36 .36 49 AS 1 E 38 up 
S+ 14 .26 32 38 AS 25 s es ae 


* Entries in this row represent the asymptotic value of R(n). 
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can be recalled at all. In the colorful 
terminology of опе such subject 
(Waugh, 1961), the most recent items 
in a verbal series reside temporarily 
in a kind of "echo box," from which 
they can be effortlessly parroted back. 
When an experienced subject is trying 
to memorize a list of serial items, more- 
over, he “fills up" successive echo 
boxes as the list is read to him and 
attempts to rehearse the contents of 
each. Не will invariably lose some 
items if rehearsal is delayed too long 
or if he attempts to load his echo box 
with more items than it can hold. We 
think it very likely that the PM func- 
tion describes the (variable) capacity 
of this mechanism. We would remind 
you in this connection that, within 
very broad limits, the rate at which 
someone is speaking does not affect 
your ability to follow his words—just 
as differences in the rate at which 
meaningless lists of digits are presented 


do not exert any profound effect on the 
PM function. 


ConcLustons 


We have tried to demonstrate the ex- 
istence of a short-term or PM system 
that is independent of any longer term 
or secondary store by showing that one 
function relating probability of recall 
to number of intervening items can de- 
scribe a number of seemingly disparate 
sets of experimental results. In doing 
so, we have deliberately avoided dis- 
cussing a number of problems raised in 
our analyses. Foremost in our list of 
problems is the definition of an item. 


Nancy C. WaucH AND DONALD A. NORMAN 


Certainly the idea of a discrete verbal 
unit is crucial to our theory. The 1m- 
terference effect that we have studied 
seems to be invariant over a broad 
class of units and combinations of 
units—single digits, nonsense trigrams, 
and meaningful words. How long a 
string of such primitive units can we 
combine and still have one item? UIs 
an item determined by our grammatical 
habits? Is it determined by the dura- 
tion of the verbal stimulus? Is it de- 
termined by both? We do not know. 

We have also avoided discussing the 
possible rules whereby items now in 
PM are displaced by later items. Are 
items lost independently of one another, 
or do they hang and fall together? It 
may perhaps prove difficult to answer 
this question experimentally, but it 
should not be impossible. 

Finally, at what stage in the proc- 
essing of incoming information does 
our PM reside? Is it in the peripheral 
Sensory mechanism? Probably not. 
The work of Sperling (1960) indi- 
cates that “sensory memory”—to use 
Peterson’s (1963) phrase—decay$ 
within a matter of milliseconds» 
whereas we have dealt in our analysis 
with retention intervals on the order 
of seconds. Does storage in PM pre 
cede the attachment of meaning to dis- 
crete verbal stimuli? Must a verbal 
stimulus be transformed into an audi- 
tory image in order to be stored in 
PM, even if it was presented visually ' 
We refer the reader to a recent paper 
by Sperling (1963) for some thoughts 
on the latter question. 


APPENDIX 


A formal discussion of the interaction 
between PM and SM can be provided by 
a simple three-state Markov process, 
The assumptions of the model are: 


1. There are three states of memory: S, 
P, and the null state, G, 


2. The probability of recalling an Нет 
from either State S or State P is unity: 
items cannot be recalled from the n“ 
State, but they may be guessed wit 
Probability g, © 

3. Items сап only pass into State > 
when they are rehearsed and, for the €* 
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periments discussed in this paper, we as- 
sume that items are rehearsed only when 
they are presented. The probability that 
an item is stored in S, given that it was 
successfully rehearsed, is a. 

4. Items in P are interfered with by 
later presentation of different items: the 
probability that an item returns to the 
null state on the presentation of the ith 
interfering item is §;. 


The following equivalents hold between 
the terms defined for the Markov model 
and the terms defined in the body of the 
paper: 


1. P;(S) is equivalent to 5 (1). 
2. P((P) is equivalent to P(i)[1— 
S(i)]. 


3. 8; is equivalent to 1 — P (i). 


Now, define the random variable т 
with Value 1 if the test item is presented, 
and with Value 0 if some other (inter- 
fering) item is presented. (We can also 
let z be a probability—namely, the prob- 
ability that the test item is presented. 
The formal statement of the model does 
not change with this redefinition.) 

The transition probabilities for any 
given stimulus item (the test item) are 
Specified by the matrix 


S P G 


s| 1 0 0 
P|ew (1—т)(1—б6д-+т(1—а@) (1—7)8: 
Glar (1—a)r 1—т 


Unfortunately, it is difficult to work with 
transition matrices of this form (with 
time-varying parameters). One approxi- 
mation would be to let 3; = 8, independent 
of i. This approximation yields an ex- 
ponential decay function of the form 
P\(P) = (1—0) (1— 8). This is 
clearly not correct for the results of our 
experiment (Figure 1); but, for some 
Purposes, it may not be a bad approxima- 
tion. A model very similar mathemati- 
cally to that produced by this simple ap- 
proximation for 8; has been studied by 
Atkinson and Crothers (1964), who 
found it to be quite good for certain types 
Of paired-associates experiments. Their 
model, however, is derived from quite 
different considerations. 
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For any experiments with controlled 
rehearsals, the probability that an item 
reaches State S (or SM) is completely 
independent of the properties of the short- 
term state (P or PM). This is true be- 
cause, as far as State S is concerned, the 
general transition matrix can be reduced 
by combining States P and G to form the 
“lumped” State P’. The new matrix is 


E P 
S [ 1 0 
P'Lar 1—ar]. 
This is a simple one-element Markov 
model. This means that although the 
complete description of the verbal learn- 
ing process requires a description of the 
short-term state, a study of only the long- 


term retention of items can ignore the 
short-term memory. 
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AN IRRITATIVE HYPOTHESIS CONCERNING THE 
HYPOTHALAMIC REGULATION OF FOOD INTAKE 


ROBERT W. REYNOLDS 


University of California, Santa Barbara 


The experimental data relating to hypothalamic hyperphagia are re- 
examined in the light of a discussion of the possible effects of the lesion 
making procedure on the brain tissue. It is shown that, with the 
exception of 1 study, all of the data are compatible with the hypothesis 
that hyperphagia is a result of the chronic irritation of the ventro- 
lateral hypothalamic “feeding center” by scar tissue surrounding 
electrolytic yentromedial hypothalamic lesions. This “irritative hy- 
pothesis” also accounts for the failure of the signs of hyperphagia to 
appear following ventromedial lesions produced with radio-frequency 
current. It is suggested that the general dual excitatory-inhibitory 
theory of the hypothalamic control of motivation and emotion needs 


reexamination. 


One of the most intriguing phenom- 
€na in the repertoire of the physiologi- 
cal psychologist is the syndrome known 
as hypothalamic hyperphagia. It is 
readily obtainable by anyone with rudi- 
mentary skill in stereotaxic surgery. 
Its several signs suggest its relevance 
to the study of the physiological bases 
of motivation and emotion, These 
Signs include postoperative voracity 
and obesity frequently accompanied by 
Viciousness and genital atrophy. No 
attempt will be made here to carry 
through an exhaustive survey of the 
literature concerning this syndrome. 
Excellent surveys are already avail- 
able in reviews by Anand (1961), Bro- 
beck (1960), Ingram (1960), Larsson 
(1954), Mayer (1953), Rosenzweig 
(1962), Stellar (1954), and Teitel- 
ашп (1961). The central experi- 
mental findings may, for the moment, 
© reduced to the following: 


Р а Bilateral electrolytic destruction 

€ region of the ventromedial hy- 
Pothalamic nuclei is followed by a 
eae increase in food intake per- 
and б for periods up to 40 or 50 days 
Бес}; eading to extreme obesity (Bro- 

^ Tepperman, & Long, 1943; 


Hetherington & Ranson, 1940). This 
is known as hypothalamic hyperphagia. 
2. Bilateral electrolytic destruction 
of the ventrolateral hypothalamic region 
is followed by aphagia and adipsia 
leading to death by inanition unless 
the animal is maintained by tube feed- 
ing (Anand & Brobeck, 1951; Anand, 
Dua, & Schoenberg, 1955; Morgane, 
1961b; Teitelbaum, & Epstein, 1962). 

3. Bilateral destruction of both the 
ventromedial and ventrolateral hypo- 
thalamic regions leads to aphagia 
(Anand & Brobeck, 1951). 

4. Electrical stimulation of the ven- 
trolateral region produces immediate 
eating behavior in satiated animals 
(Briigger, 1943 ; Larsson, 1954; Smith, 
1956) as well as extended increases in 
food intake (Delgado & Anand, 1953). 

5. Stimulation of the piece 
hypothalamus produces a cessation О 
etus in food дерей animals (Anand 
& Dua, 1955; Morgane, 1961а; Smith, 
1956). | 

6. Hypothalamic hyperphagic rats 
will not work as hard for food as nor- 
mal rats (Miller, Bailey, & Stevenson, 
1950; Teitelbaum, 1957).  Presum- 
ably, therefore, they are not hyper- 
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motivated, but simply cannot achieve 
satiation due to the destruction of a 
“satiety center.” 

These data have generally led to a 
theory of the hypothalamic regulation 
of food intake in which activity in the 
ventrolateral hypothalamic “feeding 
center” is modulated by the inhibitory 
control of the ventromedial hypothala- 
mic “satiety center." The overeating 
and obesity in hypothalamic hyper- 
phagia are normally explained, there- 
fore, as the result of the animal's inabil- 
ity to satiate itself following removal of 
its "satiety center." The ventromedial 
hypothalamic nuclei, therefore, are pre- 
sumed to function in the regulation of 
food intake as inhibitory centers, re- 
moval of which leads to “release” 
eating. 

This theory is extremely appealing 
in its simplicity and comprehensive- 
ness. [t is possible, however, that it 
is based on an artifact of the lesion 
making procedure. In studies involv- 
ing destruction of brain tissue there is 
generally an implicit, basic assumption 
that the behavioral results must be due 
to the removal of the tissue destroyed 
in the lesioning process. It would 
seem desirable, however, to call even 
this assumption into question, since it 
is quite possible that there may be 
Some residual scar tissue which could 
Serve as a source of irritation to the 
surrounding tissue. Specifically, we 
should ask precisely what the lesioning 
process does to the tissue of the central 
nervous system. The procedure almost 
universally used to produce lesions in 
studies of the physiological basis of 
motivation and emotion is electrolysis, 
Direct current, normally anodal, is 
passed through a monopolar electrode 
which is insulated except at the tip. 
The uninsulated tip is placed in the 
neural structure to be destroyed. The 
circuit is normally completed either by 
a large cathode inserted into the rec- 
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tum, or by connecting the metal frame 
of the stereotaxic instrument to the 
negative side of the power supply. 
The tissue surface in contact with the 
cathode is sufficiently large that the 
current density is very small, and no 
damage is done at that point. The 
current density at the anodal electrode 
in the brain is, conversely, very high 
because of the very small surface in- 
volved. А 
The process that occurs at the tip of 
the anode when current passes de- 
pends on the nature of the surrounding 
medium and the material out of which 
the anode is made. It is probably safe 
to assume for our purposes that the 
brain tissue acts essentially as a saline 
bath, i.e., it is a reasonably good elec- 
trolyte. Given this, the type of reac- 
tion that will occur at the anode de- 
pends on the electrode potential of the 
electrode material Metals with a” 
electrode potential less positive than 
hydroxyl ion (ie., less than +.40 for 
half cell reduction reaction) will .go 
into solution as the corresponding 
metallic ions. Thus, anodes made 0 
copper, nickel, iron, and chromium 
will deposit metallic ions in brain tis- 
Sue. According to the investigations O 
MacIntyre, Bidder, and Вомар 
(1959), and Rowland, MacIntyre, 2? 
Bidder (1960), the destruction | o 
brain tissues with such electrodes i5 4 
function of the invasion of the tissue 
by these metallic ions, In addition t° 
these metals, platinum is frequent y 
used as the electrode material for ma» 
ing lesions. However, since platinum 
has a more positive electrode potentia 
than hydroxyl ion (+.863), the ВУ" 
droxyl ion reaction is favored. There 
fore, cytolysis in this case is not cause " 
by the depositing of platinum ione; у 
the tissue, but, rather, is presumal á 
the mechanical result of the generatio 
of oxygen gas bubbles at the electro 
tip. The lesions produced when t 
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current is reversed (i.e., the electrode 
15 made the cathode) are negligible 
with steel, copper, or nickel electrodes 
(Rowland et al., 1960). However, a 
platinum electrode is apparently even 
more effective (Horsley & Clarke, 
1908) when used as the cathode, pre- 
sumably because for a given amount of 
current, the volume of gas generated is 
twice as much at the cathode (hydro- 
gen) as at the anode (oxygen). It is 
clear that either the introduction of 
metallic ions or mechanical deformation 
by the generation of gas bubbles will 
destroy the cells around the tip of the 
electrode. This is not all that hap- 
pens, however. The introduction of 
metallic ions in nerve tissue evokes a 
glial reaction and generally results in 
the formation of a scar. When the 


tissue at the electrode tip is highly 
vascularized, as is the case in the 
hypothalamus, there may be rupture of 
blood vessels followed by hemorrhage 


Fic. 1, Photomicrograph of coronal section through gr 
€sion in ventromedial region of rat hypothalamus. (The o 


his lesion was diabetes insipidus.) 
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and the formation of scar tissue. The 
formation of hemorrhagic scar tissue 
is even more likely following the 
mechanical destruction of highly vascu- 
larized tissue by a platinum anode. It 
is highly probable, therefore, that in 
addition to destroying nerve cells, elec- 
trolytic lesions will leave some scar 
tissue around the area of destruction. 
It is a well-known clinical observation 
(Walker, 1949) that scar tissue may 
serve as an irritative focus for the 
chronic production of epileptic seizures 
or the signs of basal ganglia disease 
(Glees, 1961, p. 308). It would seem 
reasonable, then, to hypothesize that 
electrolytic lesions, particularly in 
highly vascularized regions, may have 
chronic irritative effects on surround- 
ing tissue. It is a hypothesis, how- 
ever, that deserves direct experimental 
verification. 

The preceding analysis was necessi- 
tated by the observation (Reynolds, 
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1963c) that destruction of the ventro- 
medial hypothalamic nuclei by radio- 
frequency thermocoagulation did not 
produce hyperphagia. The radio-fre- 
quency lesions in one of the animals in 
that study are shown in Figure 1. 
"These lesions completely destroyed the 
medial part of the hypothalamus in- 
cluding the ventromedial nuclei with- 
out producing the chronic signs of 
hyperphagia following similarly placed 
electrolytic lesions. The lesions were 
bounded laterally by the fornices and 
extended from the plane of the 
posterior end of the paraventricular 
nucleus to the anterior end of the 
mamillary bodies. It should be noted 
that this was not simply ventricular 
distention, since there was no evidence 
of residual ventricular ependyma 
around the lesion or tissue compression 
lateral to the lesion. This original 
study was subsequently confirmed in 
a replication involving a direct com- 
parison of 55 rats with electrolytic 
lesions and 40 rats with radio-fre- 
quency lesions of comparable size in 
the ventromedial hypothalamus. Aver- 
ill and Purves (1963) also failed to 
produce hyperphagia with radio-fre- 
quency lesions in the hypothal 
although they attribute this fail 
the small size of their lesions. 
Radio-frequency lesions are pro- 
duced by a Surgical procedure identi- 
cal to that used for electrolytic lesions 
except that the current passed through 
the electrode is a biphasic sine-wave 
current at a frequency of 2 megacycles 
per second. Because of the biphasic 
character of this current, there is no 
residual deposit of metallic ions in the 
tissue. Cytolysis is produced through 
heat at the electrode tip (Aronow, 
1960). In addition to destroying 
nerve cells, the heat can cauterize small 
blood vessels in the region, and thus 
tend to minimize the incidence of 
hemorrhage. The formation of scar 
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tissue, therefore, should be relatively 
infrequent in radio-frequency lesions. 
The microscopic appearance of the 
lesions in general confirms this. si 
Since removal of the ventromedia 
hypothalamic nuclei by radio-frequency 
lesions does not produce a chronic 
change in food intake, the hypothalamic 
hyperphagia resulting from electrolytic 
destruction of the ventromedial mer 
cannot simply be attributed to remova 
of a ventromedial hypothalamic “‘satiety 
center.” We must, therefore, find an 
alternative explanation, The position 
outlined above, that scar tissue P. 
duced by electrolytic lesions may hav 
an irritative effect on the surrounding 
tissue, may be adopted as a geris 
hypothesis. Hyperphagia may аген 
therefore, from the chronic irritatlo 
of the adjacent ventrolateral “еей 
center." This hypothesis, however, T 
dependent on the theory that the ү 
trolateral hypothalamus is indeed ? 
"feeding center." Since the wc 
center" theory is based, in large ea 
on data from electrolytic lesion pt 
it seemed necessary to ascertain w 
effect of radio-frequency destruction 
the ventrolateral hypothalamus AS d 
first step in exploring the "irritatiV 
hypothesis. No difference was fou” 
in the effectiveness of radio-frequency 
and electrolytic lesions in producins 
aphagia (Reynolds, 1963b). sent 
It should be noted that transie 
irritative effects may be produced ich 
massive tadio-frequency lesions w tive 
have no long-term, chronic your Ай 
effects simply as a function of surgi is 
trauma and local tissue edema. ге 
the chronic effects, however, ЖЫЕП Si 
of importance in the present enamel 
It should also be noted that the Т, 
of radio-frequency current does 
guarantee that the lesions will be 
of scar tissue or irritative effects- о 
simply minimizes the probability |, 
their Occurrence. Conversely, the 
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of electrolysis does not guarantee that 
the lesions will have irritative effects, 
but rather it maximizes the probabil- 
ity thereof. This would presumably 
account for the fact that on occasion 
electrolytic destruction of the ventro- 
medial hypothalamus also fails to pro- 
duce hyperphagia. 

At the beginning of this paper there 
were listed six main points regarding 
the hypothalamic regulation of food 
intake. The first point, that electro- 
lytic ventromedial lesions produce hy- 
perphagia, has been accounted for 
already by the "irritative" hypothesis 
itself. It is also instructive to note 
that Kennedy (1950) finds 


that the most effective lesions lie ventro- 
lateral to the ventromedial nuclei, and often 
appear to cause little direct damage to the 
nuclei themselves. 


Furthermore, the first implication of 
the tuberal hypothalamic region in the 
control of food intake was quite likely 
the description of Fróhlich's Syndrome. 
One of the signs of this syndrome is an 
extreme obesity. The underlying path- 
ology is an expanding tumor in the 
infundibular region. It does not seem 
unreasonable to assume that such tu- 
mors could have irritative effects. 
Points 2, 3, and 4, concerning the 
ventrolateral “feeding” center, are 
compatible with the “irritative’ hy- 
Pothesis—indeed, they would be pre- 
dicted by it. But what of the remain- 
Ing two points? 

On a priori grounds, it may be 
argued that Point 5—that stimulation 
of the ventromedial hypothalamus pro- 
duces a cessation of eating in hungry 
animals—has meaning only in the con- 
text of the “satiety center” theory. 
Indeed, Krasne (1962) has recently 
Shown that stimulation of the ventro- 
medial hypothalamus at an intensity 
Capable of producing a cessation of eat- 
mg would also prevent drinking and 


could be used as the motivational 
stimulus to produce escape behavior. 
Krasne concludes, therefore, 


the stopping of eating caused by stimulation 
of the ventromedial hypothalamus and ad- 
jacent areas may not represent a primary 
effect on hunger, 


but may instead be a result of a general- 
ized “upsetting or distracting” effect 
of the stimulation. 

The studies purporting to show that 
hyperphagics are not overmotivated 
cited in Point 6 should be reexamined. 
In the Miller, Bailey, and Stevenson 
(1950) study animals with hypothala- 
mic lesions were shown to work less 
for food than normals on several dif- 
ferent tasks. The significance of this 
study must be discounted, however, 
since there is no convincing evidence 
presented that any of the animals were 
in fact hyperphagic. The authors 
state, 

Placed on a diet of dry, ground lab chow, 
four of the operated animals (to be called 
Operated High-Eaters) ate 30%-85% more 
than the average of the controls; seven 


(Operater Low-Eaters) are approximately 
the same or less than the controls [p. 257]. 


It is quite likely, however, that 5 
of the 10 control animals also ate 
more than the average of the controls, 
but in the absence of any information 
about the range of food intake in the 
control group there is no way to de- 
termine the degree to which there was 
any overlap with the “Operated High- 
Eaters.” The authors do not indicate 
that there was a statistically signifi- 
cant difference between the Operated 
High-Eaters and the controls. In 
view of their reference to a statistically 
significant difference with respect to 
another comparison in the next 
sentence, one is left with the impres- 
sion that there was not. A superficial 
reading, however, might give the op- 
posite impression. Furthermore, a 
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post facto classification into groups 
(Operated High-Eaters and Operated 
Low-Eaters) on the basis of the de- 
pendent variable is hardly a legitimate 
procedure. Perhaps more important, 
however, are the weight data as given 
in Figure 1. It appears that the Oper- 
ated High-Eaters had ап average 
weight of about 370 grams at the time 
of operation, while the controls aver- 
aged about 345 grams. Thus the Op- 
erated High-Eaters started off weigh- 
ing about 25 grams more than the 
controls. Forty days after the opera- 
tions the average weight of the Oper- 
ated High-Eaters was about 425 grams 
and the average weight of the controls 
was about 400 grams, Again a dif- 
ference of about 25 grams, Thus, the 
controls gained weight at almost ex- 
actly the same rate as the alleged 
hyperphagics. This rate was about 
1.5 gram per day, which certainly can- 
not be considered hyperphagia. It is 
true that when they were put on a 
high fat diet both operated groups 
gained weight faster than the con- 
trols. But this means that their 
"hyperphagia" appeared only with a 
high fat diet, which is contrary to nor- 
mal experience with hyperphagia. 
This study is further weakened by a 
footnote in a subsequent paper by Mil- 
ler (1955) which states that, “under 
somewhat different conditions, which 
are not yet well understood by us, the 
rats with hypothalamic lesions can not 
only eat more but also work harder 
for food [p. 141]." 

Teitelbaum's (1957) paper also con- 
tains some methodological problems, 
He compared bar-press rates for pellets 
with dynamic hyperphagics, obese hy- 
perphagics, and normal controls on 
fixed ratio schedules ranging from 1 to 
256. Animals were given access to the 
bar for 12 hours each day, with no 
food available for the other 12 hours, 
After 2 or 3 days of stabilization on 


continuous reinforcement, the rein- 
forcement ratio was increased through 
the Steps 1, 4, 16, 64, and 256 with ; 
days at each step. For the first 
steps, the dynamic hyperphagics e 
tually gave a higher average number 0 
responses than the normal animals. At 
ratios of 64 and 256:1 the ioc nd 
output of the hyperphagics dropped off, 
while that of the normals continued to 
increase. It should be noted, however, 
that even the normals gave only a mean 
of about 4,500 responses per 12-hour 
period at the 256:1 ratio. А well- 
trained animal will approximate that 
many responses in 1 hour at such а 
high ratio. It is apparent, therefore, 
that while this procedure does no 
exactly constitute extinction, it also 
does not represent “typical” perform- 
ance at these ratios, The exact sense 
in which this is a fair test of relative 
motivation is certainly not clear. There 
is the further confounding, noted by 
Teitelbaum himself, that the hyper- 
phagics are very low-activity anima 5, 
and do not show the increase in activity 
as a function of food deprivation whic 
is characteristic of normals, Hamilton 
(1963) has suggested that the failur 
of the hyperphagic to increase its — 
rate at the higher ratios may be due » 
an increased tendency to hyperthermi? 
Which is aggravated at the higher ас 
tivity levels required by high ratios, 
More recently, Hamilton and ane 
(1964) replicated some aspects of t » 
Teitelbaum (1957) апа Нато 
(1963) studies using hyperphag 
monkeys, They showed that А 
the hyperphagic monkey works for mO 
food than controls at all FR schedules UPa 
ratios where extinction in all Ss makes 
difference meaningless [p. 278]. ый 
Falk (1961) has shown a very ees 
curvilinear correlation (.84) bett ie 
increase in response rate on a bae g 
interval schedule (VI 1) cc Я 
ventromedial lesions and weight 
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tained after surgery and the post- 
surgery testing. This would suggest 
that the hyperphagics, at least in the 
initial stages, will in fact work harder 
than normals. Teitelbaum (1961) has 
commented that during the postsurgical 
testing in Falk’s experiment the ani- 
mals were maintained in a deprived or 
semistarved condition and that this 
may have introduced a new factor. It 
should be pointed out, however, that 
the obese hyperphagics in Teitelbaum’s 
Own experiment apparently received on 
the average only 1 gram of food alto- 
gether over the last 4 days of testing, 
and the dynamics about 5 grams. Con- 
sidering that the dynamics weighed 
only 20 grams more than the controls 
at the start of the testing it is quite 
likely that they would have lost the 70 
grams necessary to reduce their weights 
to 80% of the control weight during 
these 4 days of relative deprivation. 
Finally, Balinska (1963) has shown 
an increase over normal in the per- 
formance of a conditioned-reflex Type 
II in dynamic hyperphagic rabbits. In 
any case, it is probably fair to conclude 
that the notion that hyperphagics are 
not overmotivated, but rather are 
Simply insatiable, is not very firmly 
established. 

There are other studies bearing on 
the role of ventromedial hypothalamus 
in hunger motivation that should be 
mentioned. Anand, Dua, and Singh 
(1961), and Anand, Chhina, and Singh 
(1962) have studied the effect of sys- 
temic glucose and insulin injections on 
activity recorded from the ventro- 
medial and ventrolateral hypothalamic 
Tegions through both macro- and micro- 
electrodes. They report an increase in 
b d in the ventromedial region 
owing glucose and a decrease fol- 
Owing insulin injections. The effects 
ke the ventrolateral region were the 
jm These results support 

ayers (1953) “glucostatic” theory 
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in which the ventromedial hypothala- 
mic “satiety center" is activated by 
high blood glucose levels or rates of 
glucose utilization. Sutin (1963), 
however, reports no effect of glucose 
injections on potentials evoked in the 
ventromedial nucleus by stimulation of 
the amygdala and septal region. Al- 
though both Sutin’s and Anand’s groups 
used both anesthetized and unanesthe- 
tized animals, the resolution of these 
conflicting results is made difficult by 
Brooks’ (1959) observations on the 
very marked influence of depth of anes- 
thesia on activity in the ventromedial 
hypothalamus. At best, we can only 
say that the electrophysiological evi- 
dence is equivocal. Recently this point 
has been subjected to behavioral exami- 
nation. If the ventromedial nucleus 
were indeed the “glucostat,” glucose 
injections should not depress eating in 
animals with ventromedial lesions. 
Russek and Morgane (1963), however, 
found glucose suppression of food in- 
take in two cats made hyperphagic by 
electrolytic lesions in the ventromedial 
hypothalamus to be equivalent to that 
found in normal control animals. Rey- 
nolds and Kimm, moreover, found 
an exaggerated anorexic effect of glu- 
cose injections in hypothalamic hy- 
perphagic rats in comparison to nor- 
mal controls. This is reminiscent of 
the similar “paradoxical” results ob- 
tained by Epstein (1959), Stowe and 
Miller (1957), and Reynolds (1959) 
on the effect of amphetamine on food 
intake in hyperphagic rats. These 
studies tested the hypothesis that am- 
s its anorexic effect by 
stimulating the ventromedial hypo- 
thalamic “satiety center.” It was 
found, however, that there was an even 
greater effect in the absence of the 
supposed target. These results still 
have not been satisfactorily explained. 

Hoebel and Teitelbaum (1962) have 
combined the techniques of electrolytic 


phetamine ha 
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ablation, intrahypothalamic chemical 
injection, and intracranial self-stimula- 
tion in an effort to elucidate the moti- 
vational roles of the ventromedial and 
ventrolateral hypothalamus. Their re- 
sults generally seem to support the 
theory that the ventromedial hypo- 
thalamus normally has an inhibitory 
effect on the ventrolateral hypothala- 
mus. This appears to be a fairly 
elaborate and comprehensive study. 
However, the conclusions drawn by 
the authors themselves are quite shat- 
tering. Their last two sentences state, 
When an animal is hungry, lateral hypo- 
thalamic self-stimulation is more reinforcing ; 
when satiated, it is less so. It may be that 
the pleasure of lateral hypothalamic self- 


stimulation is similar to the gratification ob- 
tained by eating [p. 376]. 


We may ignore the hedonic overtones. 
The essential point is that stimulation 
of the ventrolateral hypothalamus is 
positively reinforcing. It is more re- 
inforcing when the animal is hungry. 
This, however, is the “feeding center,” 
activation of which leads to eating. 
Under normal physiological conditions, 
high levels of activity in the “feeding 
center” are supposed to correspond to 
a condition of hunger. If ventrolateral 
self-stimulation has, as the authors sug- 
gest, any relationship to normal physio- 
logical activation, the feeding-center 
theory should lead us to believe that 
such self-stimulation must be related to 
the induction of hunger rather than the 
“gratification obtained by eating.” 
This in turn leads us to the perplexing 
conclusion that the induction of hunger 
is positively reinforcing, or that ani- 
mals should learn responses which re- 
sult in their becoming hungry. This 
may or may not be true, but it certainly 
a novel inversion of most current 
theories of hunger motivation. What- 
ever the implications of this study for 
the "irritative hvpothesis," it would be 
potentially quite devastating for the 
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feeding-center theory. The impact of 
this study is further weakened, however, 
by Morgane’s (1962) demonstration of 
the dissociation of the self-stimulation 
and “feeding” systems. Medial fore- 
brain bundle lesions anterior and 
posterior to the ventrolateral hypo- 
thalamic feeding center “significantly 
suppressed" the self-stimulation re- 
sponse in the ventrolateral hypothala- 
mus without affecting the feeding re- 
sponse elicited by stimulation through 
the same electrode. 

Using bilateral cannulae implanted 
in the ventromedial or ventrolatera 
hypothalamus, Epstein (1960) | a 
served the effects on eating behavior 0 
local injections of procaine and hyper- 
tonic saline. His findings were in line 
with the “satiety-feeding center 
theory, i.e., ventromedial injections 0 
saline depressed food intake in de 
prived rats while procaine injections 
induced eating in satiated animals 
The effects were reversed for ventro- 
lateral injections. All of these results 
can be comprehended by the irritative 
hypothesis except for the effect of pro^ 
caine on the ventromedial hypothala- 
mus. It is assumed that procaine has 
an inhibitory effect on hypothalamic 
cells. Тһе ventromedial injections: 
therefore, supposedly represent a teur 
porary ablation of the “satiety center 
Epstein's behavioral data are impr, 
sive, and are clearly incompatible wit 
the irritative hypothesis. Howeve! 
they are also at odds with my observa- 
tion that radio-frequency ablation x 
the ventromedial hypothalamus dacs 
not produce hyperphagia, It is а di 
crepancy that deserves empirical r€59 
lution. i 

In a study just completed, electrolyte 
lesions were placed in rats using ange 
taxic coordinates appropriate Er у> 
region of the Fields of Forel, just ie 
sal to the ventrolateral hypothalar?- 
region. Although there were no та] 
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long-term changes in body weight with 
the exception of three cases of aphagia, 
there was an extremely significant in- 
crease in the chewing activity of these 
animals. To obtain a rough estimate 
of this chewing activity, periodic meas- 
ures were taken of the amount of food 
crumbs accumulating under the cages 
over 24-hour periods. The animals 
with the electrolytic lesions averaged 
over 20 grams of food crumbs per 24- 
hour period in comparison with 5 
grams for the normal controls. It 
Should be noted that these animals were 
not ingesting increased quantities of 
food, since they were not gaining ap- 
preciably in body weight. It could be 
argued that the Fields of Forel are 
inhibitory centers which normally func- 
tion to suppress chewing behavior. It 
Seems much more likely, however, to 
Suppose that these lesions were having 
a chronic irritative effect on the "far 
lateral" part of the feeding center, 
which Morgane (1961a) believes to 
control the motor and reflexive aspects 
of eating behavior. The fact that hy- 
perphagia did not appear in these ani- 
mals may be attributed to the fact that 
the lesions were not close enough to 
the motivational component, which, ac- 
cording to Morgane, lies more medially 
in the medial forebrain bundle, just 
lateral to the ventromedial region. 
Any attempt to approach the motiva- 
tional component more closely from a 
lateral or dorsal direction would in- 
volve destruction of the motor control 
Component, thus preventing the ap- 
Pearance of hyperphagia because of the 
Inability of the animals to chew. In- 
deed, Williams and Teitelbaum (1959) 
found that although animals with such 
sions were aphagic when maintained 
Оп а diet of dry pellets and water, they 
ae hyperphagic when placed on a 
a diet which presumably did not 

luire chewing. A reasonable ex- 
Planation for these data in terms of the 
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“release phenomenon” type of theory is 
not readily apparent. They can be ac- 
counted for without difficulty, however, 
in the context of the “irritative” 
hypothesis. 

The irritative hypothesis obviously 
does not provide an alternative theory 
concerning the hypothalamic regulation 
of food intake. It does suggest an al- 
ternative explanation to the satiety- 
center theory for the experimental 
phenomenon of hypothalamic hyper- 
phagia. At the very least, the above 
discussion indicates the need for a re- 
examination of the problem of the 
hypothalamic control of food intake. 
Analogously, we may question the gen- 
eral excitatory-inhibitory dual control 
theory (Stellar, 1954, 1960) of motiva- 
tion. It may be that the “release” 
phenomena so widely observed follow- 
ing electrolytic destruction of postu- 
lated “inhibitory” centers are the re- 
sult of irritation rather than ablation. 
Consistent with this position is the 
finding (Reynolds, 1963a) that, as in 
hypothalamic hyperphagia, the signs of 
pulmonary edema fail to appear when 
the lesions are made by radio-frequency 
thermocoagulation. This syndrome 
has been previously attributed to the 
release of an edemagenic center from 
inhibition following electrolytic lesions 
(Maire & Patton, 1956). Law and 
Meagher (1958) have demonstrated 
the elimination of sexual behavior fol- 
lowing medial hypothalamic lesions 
while hypersexuality followed electro- 
lytic lesions either anterior or posterior 
to this region. The postulation of two 
inhibitory centers, one anterior and one 
posterior to the excitatory center, 
would seem less parsimonious in the 
absence of other evidence than the sug- 
that the anterior and posterior 
lesions are irritating the excitatory 
center which lies between them. In- 
deed, Hillarp, Olivecrona, and Silfer- 
skidld (1954) have suggested that the 


gestion 
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hypersexuality they observed immedi- 
ately following preoptic lesions may be 
an irritative rather than a release ef- 
fect. Everett (1961) has similarly re- 
ported that the production of ovulation 
in rats under pentobarbital anesthesia 
was consistently obtained with small 
electrolytic lesions in the preoptic 
region, while "control lesions of com- 
parable dimensions produced by high- 
frequency electrocautery have been 
ineffective [p. 111].” Everett sug- 
gested that the positive effects obtained 
with the electrolytic lesions were the 
result of irritation. It is instructive to 
note that Nakao (1958), Glusman and 
Roizin (1960), and Wasman and 
Flynn (1962) find that stimulation of 
regions around the ventromedial hypo- 
thalamus elicits vicious behavior very 
similar to that which frequently ap- 
pears chronically following ventro- 
medial electrolytic lesions (Ingram, 
Knot, Wheatley, & Summers, 1951; 
Wheatley, 1944). The proximity of 
the points in the hypothalamus, stimu- 
lation of which yields gastric ulceration 
(Feldman, Behar, & Birnbaum, 1961; 
Long, Leonard, Story, & French, 
1962), to the regions in which electro- 
lytic lesions produce the same effect 


(Long, Leonard, Chou, & French, 
1962) should also be grounds for 
suspicion. 


The hypothalamic regulation of mo- 
tivation and emotion has been an im- 
portant integrating formulation for 
physiological psychology. The above 
discussion however, demonstrates the 
necessity for a reevaluation, If the 
theoretical structure rests on an arti- 
fact we should be prepared to salvage 
what we can, but also to begin work 
on a new and sounder edifice, 
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ROTARY MOTION IN DEPTH: 
AN INVESTIGATION AND A GENERAL THEORY 
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3 explanations of apparent reversals (oscillation) of rotary motion 
in depth attribute this effect to misjudgment of orientation. These 
explanations are based mainly on observations of a trapezoidal 
“window” in rotation. The experiments reported here show that 
perspective effects in a trapezoidal window do not increase reversal 
frequencies and that other shapes in addition to a trapezoid exhibit the 
effect with similar frequencies. The experiments also failed to confirm 
that misjudgments of orientation are a causal condition of apparent 
reversals. A general theory in terms of an identity of projected 
(retinal) motion characteristics for clockwise and anticlockwise motion 
is proposed with supporting evidence. Apparent orientation is held to 
be a consequence of rather than a necessary condition for apparent 
reversal. This theory is sufficiently general to explain apparent re- 


versals (“fluctu: 


and objects and to explain also the kinetic depth effect. 


ations") in the orientation in depth of static figures 


All these 


phenomena are held to derive from an identity of retinal projections for 
2 or more motions or orientations of an object in space. 


Objects rotating in depth relative to 
an observer are frequently judged, 
usually with monocular vision, as ap- 
parently reversing their directions of 
rotary motion. Recession of a point or 
edge is sometimes reported as approach 
and vice versa. This effect, as far as is 
known, was first reported by Kenyon 
(1898) in the following terms: 


A curious illusion connected with an ordi- 
пагу two-winged pendant fan . . . consists 
in the fan appearing to rotate in the opposite 
direction. . . . Two other illusions . . . may 
be noted. In one the vanes, instead of rotat- 
ing, seem to flap together; in the other the 
two arms appear to be continually withdraw- 
ing into and pushing out from the hanging 
rod [p. 371]. 


The effect was later mentioned by 
Miles (1929) and, more recently, ex- 
tensively investigated by Ames (1951) 
i aei explained it in terms of assump- 
1015 consequent upon past experience 


1 
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with certain shapes. Alternative ex- 
planations have since been proposed by 
Pastore (1952) and Graham (1963). 
All three explanations are based largely 
on observations of a rotating isosceles 
trapezoid ? cut and painted to resemble 
a mullioned window in perspective. 
Quantitative data from a series of re- 
cent experiments (Day & Power, 
1963), however, failed to support any 
of these explanations and a more gen- 
eral theory is presented here with sup- 
porting evidence. 

It is intended to review j 
explanations, to present experimen 


per (Day & Power, 1963) 
was used instead of trape- 
y is clarified by 


briefly earlier 
tal 


? [n an earlier рај 
the term trapezium 


zoid. This inconsistency 1 ) 
reference to the McGraw-Hill Encyclopedia 


of Science and Technology, Vol. 14, 1960: 
“Trapezoid: a term used in the United States 
for a quadrilateral with two sides parallel. 
In Great Britain such a figure is often 
called a trapezium, a term used in the United 


States for a general quadrilateral.” 
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data which render them questionable, 
and then to describe the more general 
theory. It will also be shown that the 
present theory can be extended to ex- 
plain a variety of apparent reversing 
phenomena associated with stationary 
and moving "ambiguous" figures of 
two and three dimensions. 


Current EXPLANATIONS OF 
APPARENT REVERSAL 


Ames’ Explanation 


The explanation proposed by Ames 
(1951) derives from qualitative data 
and rests upon two basic assumptions. 
First, an isosceles trapezoid cut and 
painted to resemble a mullioned window 
in perspective is seen as slanted to the 
frontoparallel plane when, in fact, it 
lies in that plane. The longer vertical 
edge is seen as closer than the shorter 
irrespective of the degree of slant, 
Second, a cue for rotary motion in 
depth is held to be a continuous varia- 
tion in the total horizontal angle sub- 
tended at the eye. Thus, when the 
window, actually in the frontoparallel 
plane, but seen as slanted to it, begins 
to rotate, an observer would expect the 
total horizontal angle to increase as 
the apparently further edge approaches. 
Since, however, the horizontal subtense 
decreases, and, since this is a cue for 
recession, the window's direction. of 
motion is seen as Opposite to its true 
direction. Ames (1951) argued that 
the windowlike properties of the shape 
enhanced the effect since a window is 
commonly rectangular. A trapezoidal 
shape, therefore, would tend to be seen 
as a slanted window, Presumably a 
door or book-cover pattern or some 
such would serve as well to emphasize 
the reversal effect, 


Pastore's Explanation 


Pastore (1952) demonstrated and 
Canestrari ( 1956) and Mulholland 
(1956, 1958) confirmed that reversals 
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of rotary motion were not confined to 
trapezoidal shapes. It was found "i 
the former that the effect occurred with 
a variety of forms including a circle, м 
ellipse, а “lopsided” elipse, a triangle, 
and a solid object. Pastore’s explana- 
tion is based on a difference ваа 
true and apparent slant. When iem 
is a discrepancy between these, ap 
parent direction of motion is opposite 
to true direction. Contrariwise, ype 
apparent slant conforms to true sies 
perceived motion direction is the sam 
as true direction, This principle bee 
demonstrated by slanting a ігаре201 E 
window at 45 degrees from the a o” 
parallel with the longer vertical eds: 
furthest, obtaining a judgment of me , 
and then rotating it through about | a 
degrees to obtain a judgment of are 
direction. Kilpatrick (1953) Lar 
pointed out that there is little to r^ 
between the explanation. proposed à 
Ames and that put forward by Pastor 
Both stress the role of apparent slat 5 
in determining apparent reversals ! 
rotary motion. 


Graham’s Explanation 


Graham (1963) has explained ity 
Parent reversals in terms of an idee 
of differential angular velocities or 
Points on the surface of the object 
clockwise and anticlockwise npa m 
and the resolution of direction by P 
Spective cues, Tt is argued that in. 
ambiguity of movement parallax e 
ing from identical differential G a 
velocities is resolved by the use of p e 
Spective cues provided by the ge 
zoidal window. Thus, the short V n 
tical edge, no matter in which quad, 
of the circle it lies, is judged as уеп 
in one of the two far quadrants. wW f 
the short edge actually lies in ee is 
these quadrants, movement directio i 
correctly judged, but when it ue 
one of the two near quadrants, ure rit 
rotation seems to occur, The simila 
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between this explanation and that pro- 
posed by Pastore (1952) is recognized 
by its author. 


TESTS oF CURRENT EXPLANATIONS 


, Although the three explanations re- 
viewed above derive írom different 
theoretical positions, they have two fea- 
tures in common. First, they are based 
almost entirely on data obtained from 
Observations of a rotating trapezoidal 
Shape. Pastore (1952) demonstrated 
reversals with a variety of shapes, but 
he argues from observations of a 
trapezoidal window slanted at 45 de- 
grees from the frontoparallel. Second, 
all three explanations treat misjudg- 
ment of slant as a critical determinant 
of apparent reversal. That is, mis- 
judgment of the frontal orientation of 
the object is regarded as a necessary 
condition for the oscillatory effect. 
The greater apparent distance of the 
Short vertical edge of the trapezoid 
regardless of its true orientation, and 
the enhancement of this effect by win- 
dow characteristics, is regarded as the 
basis of the apparent reversal phenome- 
non. In the following experiments, 
therefore, these stimulus properties 
have been examined. 

Throughout what follows the term 
apparent reversal will be used to refer 
to the subject's report of the approach 
of an edge or part of the object when 
in fact it is receding and vice versa. 
The alternation of apparent reversals 
With reports of true motion direction 
15 referred to as oscillation. The term 
(apparent) orientation will be used 
throughout to refer to the angle that 
the Shape makes with the subject's 
frontoparallel plane when it is mounted 
9n a vertical axis. 


Experiment 1: Effects of Stimulus 
habe and Pattern 


ae a series of recent experiments 
ay & Power, 1963), six stimulus 


conditions were derived from two 
shapes (isosceles trapezoid and rec- 
tangle) and three surface patterns 
(plain white, vertical black and white 
bars, and an Ames window). The 
vertical edges of the trapezoids were 
11.75 and 6.25 inches, and their non- 
parallel edges 9.625 inches. The rec- 
tangular shapes were 11.625 x 9.25 
inches. The trapezoidal window was 
mullioned and painted to resemble a 
window with thickness slanted to the 
line of regard. The objects, which 
were mounted in an enclosed chamber, 
rotated clockwise at 8 rpm and were 
viewed monocularly from 52 inches 
through a viewing tube. Six inde- 
pendent groups of 12 subjects drawn 
from undergraduate classes in psychol- 
ogy signaled apparent reversals in the 
direction of rotation by operating a 
press-switch during two 20-revolution 
trials. Reversals were recorded on an 
automatic counter. 

Mean frequencies of reversals for 
each of the six shapes and patterns 
based on two trials are shown in Fig- 
ure la. Although there were large and 
significant differences between reversal 
frequencies for the two shapes (р < 
.001), there was none between patterns 
(p>.10). The surface character- 
istics of a mullioned window in per- 
spective failed to increase significantly 
the frequency of apparent reversals, a 
finding contrary to the generally held 
view that resemblance to a commonly 
rectangular object contributes to the 
effect. Identical results were obtained 
in a repetition of this experiment using 
subjects quite unfamiliar with the ef- 
fect. In the case of the rectangular 
shapes only a small proportion of sub- 
jects in each group reported apparent 
reversals, whereas all subjects reported 
reversals with the trapezoidal shapes. 

' In a second experiment under the 
same viewing and responding condi- 
tions three shapes (circular, elliptical, 
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Fic. 1. Part a: Frequencies of apparent 
reversal of rotary motion for two shapes 
(trapezoidal and rectangular) and three 
surface patterns (plain white, vertical bars, 
and "window"), Part b: Frequencies of re- 
versal of rotary motion for a circular, ellip- 
tical, and irregular shape. (In this and sub- 
sequent figures the plain white shapes are 
shown in black for convenience, ) 


and irregular) with a vertical length of 
10 inches were observed during two 
20-revolution clockwise trials by inde- 
pendent groups of 15 subjects. The 
histograms in Figure lb show that 
whereas the elliptical and irregular 
shapes exhibited similar frequencies of 
reversal (p > .05), the circular shape 
did so less frequently. The apparent 
reversal frequency of the latter shape 
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differed significantly from that of both 
ellipse and irregular shape (р < .01). 


Experiment 2: Apparent Orientation 


Since there are no quantitative data 
on the apparent orientation of a trape- 
zoidal window and other shapes when 
stationary, Experiment 2 (Power, 
1964) was designed to obtain such 
data for two orientations of six shapes. 
Using the same monocular viewing 
conditions as in Experiment 1, six 
groups of 10 subjects matched the 
orientation of a trapezoidal window, aa 
ellipse, a rectangular window, an 
three irregular shapes while they were 
Stationary. One group was assigned = 
one shape and was required to match 
its orientation when it was slanted a 
60 and 120 degrees from the sagitta 

0 degree) plane. The shapes were 
suspended from the top of the viewing 
chamber, and the subject matched their 
orientation by means of a rectangular 
shape with a lattice pattern on its sur- 
face. "The axes of the standard an 
comparison shapes were coextensive, 
and Preliminary observations һа 
Shown that the comparison rectangle 
could be accurately oriented into the 
frontoparallel plane, Adjustment 0 
the slant of the lower comparison rec 
tangle was made by means of a han 


Fic. 2. Apparent orientation 
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(dotted lines) of four sh 


sa 


3 indow: 
apes (trapezoidal wind 


у от 
rectangular window, ellipse, and irregular) at two angles (60 and 120 degrees) fri 


sagittal (0 degree) plane. 
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wheel in front of the subject which was 
connected by means of a pulley and 
belt to the axis of the shape. The task 
assigned the subject involved adjust- 
ing the slant of the lower comparison 
shape so that it lay in the same appar- 
ent plane as the upper standard shape. 
_ The mean angular matches for the 
six shapes at 60 and 120 degrees from 
the sagittal plane are shown in Figure 
2 as dotted lines. The 60- and 120-de- 
gree true orientations are shown as 
heavy continuous lines, and the shapes 
themselves are shown accompanying 
each diagram. While the ellipse, the 
Tectangular window, and the three ir- 
regular shapes were judged as oriented 
in the correct quadrants at both angles, 
the trapezoidal window was judged as 
being oriented with its shorter vertical 
edge in one of the two far quadrants 
Tegardless of its true orientation. 
Whereas this judgment was correct for 
the 60-degree position, it was incorrect 
for the 120-degree position. 

Thus the trapezoidal window, whose 
mean apparent reversal frequency in 
Experiment 1 was 21, was misjudged 
as to its quadrants of orientation in one 
of its two positions. On the other 
hand, the orientation quadrants of the 
elliptical and irregular shapes whose 
mean reversal frequencies were 24 and 
17, respectively, were correctly judged.? 
These data suggest that misperception 
of orientation is not a necessary condi- 
tion for the occurrence of apparent 
reversals of direction in rotary motion. 


Experiment 3: Relationships between 
Apparent Orientation, Initial Move- 
ment Direction, and Reversals 


The purpose of the third experiment 
(Power, 1964) was to investigate un- 


Seat ite order in which they occur in 

queia, , the mean apparent reversal fre- 

rene s of the two irregular shapes which 

be 17 ‘ot used in Experiment 1 were found to 
and 18 in a separate experiment. 
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der the same conditions the relation- 
ships between judgments of orientation, 
of initial direction of movement, and of 
reversals. The second of these (initial 
direction of movement) was included 
since Graham (1963) and Pastore 
(1952) argue that, in the case of the 
trapezoidal window, apparent orienta- 
tion determines perceived direction of 
movement. 

The apparatus and viewing condi- 
tions were similar to those of Experi- . 
ment 2. The standard shapes were a 
trapezoidal window, an ellipse, an ir- 
regular shape, and a rectangular win- 
dow all of which had been used in 
Experiment 1. For judgments of 
orientation the four shapes were placed 
in the subject's frontoparallel plane and 
the two asymmetrical objects (trape- 
zoid and irregular) were positioned 
with one edge both left and right. 
Thus there were six conditions. Inde- 
dependent groups of 10 subjects drawn 
from introductory classes in psychology 
observed one shape during all three 
phases. In the first phase the subject 
adjusted the lower latticed rectangle so 
that it appeared in the same plane as 
the upper standard. Immediately this 
match was made the standard was 
rotated anticlockwise at 4.5 rpm and 
the subject indicated apparent direction 
of movement (Phase 2). In the third 
phase the subject indicated apparent re- 
versals during a 20-revolution trial as 
in Experiment 1. 

The results are shown in Table 1 for 
the four shapes and the two positions 
of the trapezoidal and irregular shapes. 

Pastore's (1952) explanation states 
that, when the longer vertical edge of 
the trapezoid is judged nearer when 
in fact it is farther, rotation direction 
appears opposite to true direction. 
Judgments of initial motion direction 
immediately following orientation judg- 
ments (Table 1, fifth row) show that 
when the longer edge of the trapezoid 
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E 


í Y {р REVERSAL 
MEANS AND STANDARD DEVIATIONS OF ORIENTATION MATCHES AND REVERSA 


FREQUENCIES WITH NUMBER OF 
MovEMENT as Clockwise (С) 


SUBJECTS JUDGING INITIAL 
AND ANTICLOCKWISE (AC): 


EXPERIMENT 3 


Shapes 
Trape- 
* Trapezoidal ^ zoidal 
Irregular Irregular window window Rectan- 
(protub- —(protub- uot sheet gular 
Tight) ЧП AS leto Ellipse window 
Orientation bd 94.6 91.1 320.9 38.3 103.5 924 
Degrees from _ Р 
bois SD $06 — 5.6 — 103 1266 3235 3.67 
plane) 
Reversal frequency — X 164 220 289 255 23.9 е i 
SD 9.1 12.65 9.33 7.17 8.07 е 
Initial movement € 4 5 0 10 2 1 
direction AC 6 5 10 0 8 9 


was judged nearer on the left, all judg- 
ments of direction were correct (ie, 
anticlockwise), When, however, the 
longer edge was judged nearer on the 
right, all judgments of rotation direc- 
tion were incorrect (clockwise). Thus, 
Pastore’s (1952) statement is valid for 
the trapezoidal window. The phi co- 
efficients between judged quadrants of 
orientation and judged initial direction 
of motion (clockwise or anticlockwise) 
have been computed for the four 
shapes, the two positions of the irregu- 
lar and trapezoidal shapes being com- 
bined for this purpose. These coeffi- 
cients are .297 for the irregular shape, 
.900 for the trapezoid, .234 for the 
ellipse, and .250 for the rectangle. The 
relationship between j udged orientation 
and judgments of initial direction of 
motion achieves significance only in 
the case of the trapezoidal window and 
for all other shapes fails to reach sig- 
nificance. Thus, whereas this rela- 
tionship holds for one shape ( trapezoid) 
which manifests 29 and 25 apparent 
reversals during rotation, it does not 
do so for two other shapes which mani- 


2 
fest reversal frequencies of 16 and 2 
(irregular) апа 24 (ellipse). 


Summary of Experimental Data 


The data from the first experimen! 
show that the apparent reversal no 
quency of a trapezoidal window 18 jes 
different from the reversal freguena а. 
of the same shape with alternative pe 
face patterns. "These data show alar 
that, with the exception of rectangt x- 
and circular Shapes, other shapes pe 
hibit apparent reversals with = 7 
the same frequency as a inae 
The results from Experiment 2 bear 
Strate that, whereas the quadrants ^, 
orientation of a trapezoidal window * e 
misjudged when the short vertical € 
is near, the orientation quadrants | 
other reversing shapes are not. Has 
third experiment no evidence een 
found for a causal relationship betw гё 
apparent orientation and apparent 17 
versals, nor was there evidence e 
relationship between apparent ени 
tion and initial apparent direct? jaa! 
movement. Although the trapez 


Я E v Р jenta 
window is misjudged as to its or! 
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tion and also exhibits apparent re- 
versals during rotation, the former is 
not necessary for the latter with the 
other shapes. In brief, the hypothesis 
that apparent orientation deriving from 
Perspective cues is necessary for ap- 
parent reversals has not been sustained. 


Ам ALTERNATIVE EXPLANATION 


In this and the following section a 
theory of apparent reversals of rotary 
motion in depth will be proposed. It 
Will be further argued that the theory 
applies also to cases of reversal in static 
figures and objects. 

When a plane shape rotates in depth 
relative to an observer, the retinal pro- 
Jection of its motion is a repeated ex- 
Pansion and contraction as the object 
moves from the frontoparallel to the 
Sagittal plane of the observer. If the 
retina is considered as a plane surface 
then the motion of projected points 
follows a sine function. The retinal 
Projection expands rapidly as the ob- 
Ject moves from the sagittal plane 
(“edge-on” position) and slows, and 
finally ceases as it approaches and 
reaches the frontoparallel ("full-on" 
Position). 

Now, when the shape rotates clock- 
Wise from the sagittal to the fronto- 
Parallel plane, the retinal projection ex- 
Pands from its minimum width deter- 
mined by the thickness of the object, 
to its maximum width determined by 
Its horizontal extent. But when the 
Shape moves anticlockwise between 
these planes an identical expansion of 
its retinal projection occurs. That is, 
there is an identity of retinal motion 
characteristics for clockwise and anti- 
Clockwise rotation. The same applies 
to the contraction phase as the object 
Moves from the frontoparallel to the 
Sagittal plane. 

oo (1963) has shown ina de- 
tial analysis the identity of differen- 
angular velocities for points on the 
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surface of a rotating plane during the 
approach and recession phases of ro- 
tary movement. It is clear from this 
treatment that the subject cannot judge 
whether these points are approaching 
in a near quadrant or receding in a far 
quadrant, since the same sign and 
quantity of differential angular velocity 
apply in each. 

Apparent reversals in rotation direc- 
tion can be explained in the following 
terms. Since there is an identity of 
retinal motion for the two directions of 
rotation, then, in the absence of cues to 
direction in depth such as retinal dis- 
parity, the object will occasionally be 
judged as moving clockwise and occa- 
sionally as rotating anticlockwise. 

The data from these experiments in- 
dicate that the identity of retinal motion 
characteristics for approach and reces- 
sion of the rotating shape is a necessary 
condition for the occurrence of appar- 
ent reversals. When this condition ob- 
tains, therefore, it would be expected 
that judgments of approach or of re- 
cession would be equally probable. 
That is, when the retinal projection of 
the shape changes from expansion to 
contraction (and vice versa), the sub- 
ject will judge it as either approaching 
or receding, and there is no reason to 
suppose that other than chance factors 
determine this judgment. In support 
of this is the observation that the rotat- 
ing shapes are not reported as revers- 
ing during every revolution. Reversals 
are reported along with complete revo- 
lutions and, as will be seen below 
(Experiment 4), about half the pos- 
sible number of apparent reversals oc- 
cur during a trial. Although the retinal 
identity of clockwise and anticlockwise 
motion is clearly recognized by Graham 
(1963), he nevertheless attributes the 
apparent reversing effect to apparent 
orientation determined by cues to per- 
spective. It is pertinent, however, to 
raise the issue concerning this rela- 
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tionship between apparent orientation 
in depth and apparent reversals. It 
the object appears to rotate, say, anti- 
clockwise, then it would be expected 
that apparent orientation at any one 
moment would be in accord with that 
direction and vice versa. That is, ap- 
parent orientation can be regarded as 
a consequence rather than as a cause 
of apparent direction of movement. 
The data from Experiments 2 and 3 
are convincing in showing that appar- 
ent orientation does not determine 
either reversals or initial direction of 
movement. This relationship is further 
discussed below. 

The principle involved in this ex- 
planation is simply that an identity of 
retinal motion for two directions of 
rotation leads to an ambiguity re- 
sulting in two possible judgments of 
direction. It would be expected, how- 
ever, that if cues to motion direction 
were available, then apparent reversals 
would either be reduced in frequency 
or eliminated. One such cue is that 
provided by retinal disparity which 
gives information about the true orien- 
tation of the object in depth from mo- 
ment to moment. Apparent reversals 
are not reported with binocular viewing 
unless the viewing distance is greater 
than that within which retinal dis- 
parity is normally operative (Ames, 
1951). 

Further cues to motion direction are 
probably provided by certain shapes 
which exhibit few or no reversals, Al- 
though the role of these cues has yet to 
be confirmed, it is reasonable to discuss 
their possible modes of operation here. 
If, as with rectangles, at some stage 
during rotation the vertical edges pro- 
ject equal visual angles, then decrease 
of one angle with recession, and in- 
crease of the other with approach, 
could provide directional cues. That 
is, equality of retinal projections when 
frontoparallel provides information for 
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direction discrimination ; information 
not available when the two vertical 
edges are unequal so that at no stage 
of rotation are “base-line” conditions 
for comparison available. Support for 
the role of this cue is provided by Mul- 
holland’s (1956) data showing that ap- 
parent reversals occur predominantly 
with asymmetrical objects. It is not 
possible, however, to speculate with 
conviction on the directional cues Te- 
sponsible for the reduced frequency 0 
reversal of the circular shape ( Expert 
ment 1). Although this shape reverse 
with a relatively high frequency, it di 
so less than the trapezoidal, elliptical, 
and irregular shapes, 


ANGLES OF APPARENT REVERSAL 


ssl 
The theory of apparent revers? 


which has been set out here states that 
the effect derives from an identity e 
horizontal retinal motion projection? 
for the approach and recession of е 
object rotating in depth. It ap 
that if an edge or point is judged : 
receding when in fact it is approaching 
then the orientation of the object WP 
also be misjudged. That is, the ар 
parent orientation of the object Wi" 
be determined by the apparent e 
tion of motion, "Thus, the theory !5 zi 
this regard opposite to those of giae 
(1951), Graham (1963), and pasior 
(1952) insofar as apparent orientatio 
is held to be a consequence of, oa 
than a necessary condition for, арра 

ent reversals. In brief, it is conten p 
that whichever direction of кана а 
judged, both being equally proba es 
then orientation will be judged ? 

cordingly. 

It also follows that apparent 
als would be expected to occu 
the object is nearly frontoparalle ses 
nearly sagittal. As the object Pane 
through the frontoparallel the Te", 
projection commences to deeresm, se 
horizontal extent, and as it P? 


revers- 
r whe” 
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through the sagittal plane retinal ex- 
pansion occurs. At these points of 
change an observer can make one of 
two judgments of rotation direction. 
At all other orientations reversals 
would not be expected since these 
would necessarily result in abrupt 
changes in apparent orientation. For 
example, if the right edge of a shape 
15 judged as approaching when about 
45 degrees from the frontoparallel, an 
apparent reversal would require a sud- 
den jump of the edge to 135 degrees 
(the opposite orientation). If a re- 
versal were to occur at this point with- 
Out such an abrupt reorientation, then 
the object would necessarily appear to 
expand as the edge receded. There is 
no evidence for such sudden changes 
11 apparent orientation. Thus, as the 
object passes through the frontoparallel 
and sagittal planes of the observer, 
changing phase from expansion to con- 
traction, apparent direction of rotation 
may be judged as clockwise or anti- 
clockwise. 


Experiment 4: Angles of Orientation 
at Which Apparent Reversals Occur 


The fourth experiment was con- 
ducted to test the prediction that ap- 
Parent reversals occur when the object 

In either the sagittal or frontoparal- 
el planes of the observer. 

i Using the same apparatus and view- 
sipa ditions as in Experiment 1, the 
еШ" were required to press the 
©те when apparent reversals oc- 
ton Дет These responses were re- 
corde on a constant speed paper re- 
оосар, -© Also recorded were the 
Was oa when one edge of the shape 
Bree) Sar and exactly sagittal (0 de- 
latter gi The distance between these 
and jt к represented 360 degrees, 
lish t е as therefore possible to estab- 
at Which eet positions of the shapes 
res apparent reversals occurred. 
groups of five subjects cach 
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Fic. 3. Mean angles of apparent reversal 
for three shapes (trapezoidal window, ellipse, 
and irregular) rotating clockwise (dotted 
and dashed lines) and anticlockwise (dotted 
lines). (The mean reversal angles for the 
two directions of rotation combined are also 
shown, continuous lines. The apparent re- 
versal angles are shown separately for those 
occurring near 90 degrees, upper row, and 
near 270 degrees, lower row.) 


underwent 10 10-revolution trials, 5 in 
which rotation was clockwise and the 
other 5 in which it was anticlock- 
wise. One group observed the ellipse, 
another the trapezoidal window, and 
the third the irregular shape. All 
shapes were those used in Experiment 
1, and the viewing and responding con- 
ditions were also the same. 

The mean angles of reversal are 
shown in Figure 3 for clockwise and 
anticlockwise rotation and for the two 
directions taken together. It can be 
observed that the mean reversal angles 
for the two directions combined ap- 
proximate closely to the true fronto- 
parallel (90 and 270 degrees), while 
those for the two directions considered 
occur beyond and before the 


singly ] 
frontoparallel. Of the 15 subjects, 
there were only 2 who signaled Je 
versals near the sagittal position. 


These are not shown in Figure 3: 

In view of the ambiguity of the stim- 
ulus conditions and the minimum ve- 
locity of the retinal projection when the 
object is frontoparallel, it is reasonable 
to attribute the difference between ap- 
parent reversal angle and frontoparallel 
for the clockwise and counterclockwise 
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The principle of belief congruence; previously presented in Rokeach 
(1960), is elaborated here so that it can be applied to instances of 
cognitive interaction. The belief congruence model is compared and 
contrasted with Osgood and Tannenbaum's congruity principle, the 
main difference between the 2 principles being that the former asserts 
that the outcome of cognitive interaction cannot be accurately 
predicted from a knowledge of the evaluative meaning of the 2 ele- 
ments judged in isolation, while the latter asserts that it can. Quanti- 
tative predictions generated by the 2 models are compared with 
obtained results. It is found that the average absolute error of the 
congruity model is 1.07 and the average absolute error for the belief 
congruence model is .34, thus suggesting that the latter model's 
predictions are approximately 3 times as accurate as the congruity 


model’s predictions. 


The main purpose of this paper is 
to elaborate further the principle of be- 
lief congruence (Rokeach, 1960) so 
that it will apply to instances of cogni- 
tive interaction. A second purpose is 
to compare and contrast the predictions 
generated by this principle with those 
generated by Osgood and Tannen- 
baum’s (1955) congruity principle. It 
will be suggested that the two prin- 
ciples cannot be equally valid since they 
appear to lead in many instances to 
contradictory predictions. A third 
purpose is to present the results of 
empirical work designed to determine 
which set of predictions is the more 
accurate, and a fourth purpose is to 
try to reconcile the data and interpre- 


1 Тһе research described herein was sup- 
ported by a grant from the National Science 
Foundation, and an early draft of this paper 
was prepared by the senior author at the 
Center for Advanced Study in the Behavioral 
Sciences. The final version of this paper 
was written following a stimulating exchange 
of views with Charles E. Osgood. We are 
grateful to Osgood for his criticisms, but 
this acknowledgement should not be taken to 
mean that he necessarily concurs with the 
formulations and conclusions stated herein. 


tations presented by Osgood and his 
co-workers with those in our own 16 
search program. 


THE PRINCIPLE or BELIEF 
CONGRUENCE 


The principle of belief congruent? 
asserts that we tend to value a B 
belief, subsystem, or system of beliel 
in proportion to their degree of congr! 
ence with our own belief system an^" 
further, that we tend to value peop е 
in proportion to the degree to which 
they exhibit beliefs, subsystems, or 5X? 
tems of belief congruent with our ока 
Congruence can be defined both а 
terms of similarity and importane” 
Given two beliefs or subsystems О 
lief equal in importance, the one mO?” 
similar to our own is the more pet 
gruent; conversely, given two beat 
or subsystems perceived to be edu? zt 
similar to our own, the one judge nt 
more important is the more congrue 
with our own belief system. ap ciple 

Before discussing how the фе 
of belief congruence applies to the ek 
of cognitive interaction, it is first n^ 
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sary to assume that any stimulus (a 
verbal concept, percept, or event) has 
the property of activating within a 
person only a portion of his belief 
System—that portion which is relevant 
to or associated with the stimulus. 
Which particular portion and how 
broad a portion of the belief system is 
activated depends on the particular 
stimulus presented. A particular stim- 
ulus may activate only a single belief 
or a whole subsystem of beliefs varying 
in breadth or in the number of inter- 
related component beliefs. KHRUSH- 
CHEV or COMMUNISM, for instance, 
may activate a broader area of the 
person's belief system than IVAN 
PETROV Or COLLECTIVE FARM. 

Moreover, a person should typically 
have no difficulty, when presented with 
an isolated stimulus, in assessing the 
direction and importance of the beliefs 
1 activates along any given scale (say, 
a semantic differential scale) because 
his previously learned belief system 
should provide him with a generalized 
frame of reference for judging direc- 
tion and importance. 

Consider now what may reasonably 
be expected to happen when one stimu- 
lus is associated with another through 
Some assertion. The linkage is as- 
Sumed to give rise within the person 
to a unique configuration activating 
Comparison processes designed to de- 
termine whether or not there is to be 
Cognitive interaction and, if so, its out- 
Come. An assertion, following Osgood, 
тау be associative or dissociative and 
el take one of four forms: simple 
Inguistic qualification (LAZY ATH- 
LETE), simple perceptual contiguity 
id advertisement showing a MOVIE 
ae M CAMELS), statements of 
i cation (JONES is a PHYSICIST), 
а Source-object assertions „(ое 
intera Doses TEST BAN). Cognitive 
tigen ion refers to the process by 

of which a single evaluative 
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meaning emerges as a result of combin- 
ing two stimuli, each having their sep- 
arate meanings. 

All the four types of assertions men- 
tioned above have something in com- 
mon: they are unique configurations 
cognitively representing a character- 
ized subject (CS)—a person, thing, or 
idea characterized or qualified in some 
unique way. The unique configuration 
consists of two components: a subject 
(S), capable of being characterized in 
many ways, and a characterization 
(C), capable of being applied to many 
subjects. 

Let us now restate the principle of 
belief congruence in order to apply it 
to all conditions which might and might 
not lead to cognitive interaction and at 
the same time to quantitatively predict 
its outcome. Whenever two stimuli 
are brought into association with one 
another through an assertion they form 
a unique configuration activating two 
kinds of comparison processes: the 
stimuli will first be compared for mu- 
tual relevance, and if they are perceived 
to be at least partially relevant for one 
another, they will then be compared 
for relative importance. 


Comparison of Relevance 


Upon presentation of CS a person 
will first ascertain whether or not the 
two components, C and S, are relevant 
for one another. If he judges them to 
be not relevant, there are no psycho- 
logical grounds for expecting cognitive 
interaction. The irrelevant component 
should be ignored ; that is, it should 
exert no influence on the evaluation of 
CS. Consequently, CS should be 
ated in about the same way the 


evalu : 
ponent is evaluated. 


remaining com 
Comparison of Relative Importance 
More interesting for psychological 
theory and research are those instances 
wherein a person judges C and S to 
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be at least partially relevant for one 
another. 

Relative Importance of C and S. 
Under the condition of at least partial 
relevance, a person will next compare 
the two components for relative im- 
portance with respect to another, 
judged within the general frame of 
reference provided by one’s previously 
learned belief system. And, we con- 
jecture, the evaluation of the configura- 
tion CS should be a simple average of 
the evaluations of C and S considered 
separately, weighted by the perceived 
importance of C and S relative to one 
another within the context CS: 


dos = (w)do + (1—w)ds [1] 


where des, do, and d; refer, respec- 
tively, to the degree of polarization 
(positive or negative) of the charac- 
terized subject, the characterization, 
and the subject, and where (w) and 
(1— w) refer to the perceived im- 
portance of do and dg relative to one 
another in the context CS. 
Relative Importance of CS and C. 
When C reaches 100% in importance 
(and S 0%) we must posit the activa- 
tion of yet an additional comparison 
process over and above a comparison 
of C and S, namely, a further compari- 
son of the relative importance of CS 
and C. Suppose a person perceives 
IRRESPONSIBLE as all-important in the 
Context IRRESPONSIBLE FATHER. In 
such a case, it might be reasonable at 
first glance to assume from Formula 1 
that a person's evaluation of CS would 
be completely determined by his evalu- 
ation of С; that is, he evaluates CS the 
same way he evaluates C. Such an 
assumption, however, overlooks the 
possibility that by virtue of the inter- 
action between C and S within the 
framework of one's total belief system 
a person's evaluation of CS may he 
even more extreme than his evaluation 
of C. He may feel strongly negative 
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toward IRRESPONSIBLE but even more 
strongly negative toward IRRESPONSI- 
BLE FATHER because he may feel that 
fathers especially ought not to be ir- 
responsible. In other words, he may 
evaluate CS as falling outside the limits 
of C and S. A characteristic which is 
negatively evaluated in isolation may 
be judged even more negatively when 
it is lodged in a positive subject; con- 
versely, a characteristic which is posi- 
tively evaluated in isolation may be 
judged even more positively when it 1$ 
lodged in a negative subject.? 

The combined effects of the two com- 
parison processes (C versus S, and CS 
versus C) on the evaluation of CS un- 
der the condition described may be 
quantitatively expressed as follows: 


dos = do + (v)do [2] 


where de represents the effect of the 
first comparison process (C versus 2. 
and where (v)d, represents the addi- 
tional effect of the second comparison 
process (CS versus C), and where " 
represents the extent to which the pe! 


2 We are dealing here with a comparison 
process sometimes referred to as overassim- 
lation, a process whereby a stimulus nO 
only takes on the valence of another stimulus 
With which it is associated, but in addition 
takes on an even stronger valence. тыз 
Phenomenon can often be observed in every” 
day life: (a) A Jew who converts to Chris 
tianity is sometimes regarded as БЕ 
“worse” than a Goy; (b) a positively eva ГА 
ated person who defects to the other side р 
even more severely condemned than а n 
tively evaluated person on the other Sfi 
(attitude toward the renegade, Roker” C 
1960); (c) a negatively evaluated pers 
who defects to our side is even more yim 
embraced than those already on our side. m- 

All the above instances involve a due 
parison between CS and C. For the SM 
completeness the converse possibility sho on 
also be mentioned, namely, a compare" 
process involving the relative important igg 
CS and S when S is perceived to be es 
important. On intuitive grounds, Ho io" 
this does not appear to us to be psy по! 
logically meaningful and will therefore 
be given further consideration here. 
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son attaches greater importance to 
CS than he does to C. When the 
person judges CS to equal C in im- 
portance, v = 0 and des = de. When 
the person judges CS to exceed C in 
Importance, v will equal some coeffi- 
cient expressing the extent to which 
CS is perceived to exceed C in impor- 
tance, and dos will exceed йс by the 
amount (v)de. When dg is positive, 
dcs will be more positive, and when dc 
is negative des will be more negative. 

One important restriction should be 
noted here. The value of des cannot 
be allowed to exceed the most extreme 
Score on whatever happens to be the 
Scale of measurement employed in em- 
pirical research (e.g, +3 on a scale 
ranging from —3 to +3). Conse- 
quently, de + (v)dc cannot be allowed 
to exceed this extreme score either. 
If do, for example, already equals the 
most extreme score then the sum of dc 
апі (о) is arbitrarily assigned the 
Same score. 


CONTRAST BETWEEN THE BELIEF CON- 
GRUENCE AND CoNGRUITY PRINCIPLES 


The formulation of the principle of 
belief congruence differs from Osgood 
and Tannenbaum’s congruity principle 
1n several important respects :? 


: l. The congruity principle is an ad- 
ditive model which predicts the out- 
S of cognitive interaction solely 
пе a knowledge of the direction and 

gree of polarization of the two 
аш considered in isolation. In con- 
‘rast, the principle of belief congruence 
i a configurationist model asserting 

at the comparison processes cannot 
woe activated until the two stimuli 
are linked together to form a unique 
РЫ... most detailed exposition of the 
oH Bruity principle will be found in the 

ral article by Osgood and Tannenbaum 
good For further elaborations, see Os- 
Oa Suci, and Tannenbaum (1957) and 
the et (1960). An excellent summary of 

work is given by Brown (1962). 


gestalt (Asch, 1946, 1952) and, conse- 
quently, the outcome of the cognitive 
interaction cannot be accurately pre- 
dicted solely from a knowledge of the 
direction and intensity of the two 
stimuli considered separately. 

This is not to say, however, that the 
evaluative meaning of the unique con- 
figuration, CS, is unpredictable. As 
already noted, CS is evaluated against 
the general background of one’s total 
belief system. Elsewhere, Rokeach 
(1960, 1964, 1965) has attempted to 
describe various types of beliefs within 
the belief system and to locate them 
along a central-peripheral dimension. 
These formulations provide us with a 
basis for predicting, at least roughly, 
the evaluative meaning of any given 
CS: they guide educated guesses about 
the relative location of beliefs activated 
by C versus S, and by CS versus C, 
along a central-peripheral dimension. 

2. The two models differ in their 
conception about the psychological 
meaning of incongruity. Exactly what 
is incongruous with what? According 
to Osgood and Tannenbaum (1955) 
the incongruity is between the C and 
the S, and the greater the disparity 
between C and S the greater the incon- 
gruity and the greater the pressure to 
reduce it when the assertion is positive ; 
the reverse holds when the assertion is 
negative. Reduction of incongruity be- 
tween C and S is achieved by a com- 
promise in which C and S both change 
toward or away from one another in 
inverse proportion to their respective 
degrees of polarization.* 

а Тһе formula Osgood, Suci, and Tannen- 
baum (1957) use to predict the outcome of 
congruitive interaction is: 


Ide] dc) Lo l ds) 
des = Тар] + [asl ^ + Tae] + Tast 


“where |d| is deviation or polarization from 
neutrality on the scales regardless of sign, 
d is deviation from neutrality with respect to 


sign [p. 278].” 
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The psychological meaning of incon- 
gruity is formulated somewhat differ- 
ently within the  belief-congruence 
framework. The incongruity arises 
not from the psychological disparity 
between C and S but from the disparity 
between C and CS, or between $ and 
CS, or both. The evaluative meaning 
of CS must somehow be made maxi- 
mally congruent with one's belief sys- 
tem, which includes within it pre- 
viously learned evaluative meanings 
of C and 5. Since CS cannot be com- 
pletely congruent with both C and S 
(assuming some discrepancy in the 
evaluative meaning of C and S), the 
question arises about the exact process 
whereby the evaluative meaning of CS 
will become maximally congruent with 
C and S. The Principle of belief con- 
gruence attempts to describe how this 
outcome comes about. The more dis- 
crepant the relative importance of C 
and S with respect to one another the 
greater the pressure to evaluate CS, 
positively or negatively, like C or like 
5, whichever is the more important, but 
not both. The more C equals S in im- 
portance, the more equalized the pres- 
sure from C and S. 

3. The congruity principle is essen- 
tially a compromise model, the only 
exception being the following : when 
an extremely polarized stimulus is posi- 
tively associated with a neutral stimu- 
lus the meaning of the combined 
stimuli is assimilated to that of the 
extremely polarized stimulus. In con- 
trast, the principle of belief congruence 
allows for various degrees of compro- 
mise, and assimilation and over- 
assimilation depending on the relative 
importance of C versus S and SF € 
versus CS, in the context CS, regard- 
less of the degree of polarization of С 
and S, considered separately. 

4. The congruity model formally 
posits an assertion constant in the case 
of source-object assertions: there is a 
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greater force acting on the object than 
on the subject; that is, according to 
Brown (1962), “the object of the bond 
will be more affected than the source of 
the bond [p. 26].” In contrast, the 
Principle of belief congruence denies 
the necessity of such an assertion con- 
stant and states instead that the magni- 
tude of the force acting on the source 
and object depends on the relative im- 
portance of the beliefs activated by 
Source and object. For example, in 
the assertions KENNEDY praises GOP 
and сор praises KENNEDY, the congrtt- 
ity principle’s assertion constant shoul 1 
lead to the expectation that there wil 
he a greater average change of value 
on the two objects than on the a 
sources. In line with the principle 0 
belief congruence we would expect 11" 
stead that the less important compo- 
nent, whether source or object, woul 

be the more affected. 


A TEST or тне Two PRINCIPLES 


Before presenting the results of em- 
Pirical work specifically designed tO 
test the contradictory predictions gen 
crated by the two principles, attenter 
should first be drawn to a relevan 
and substantial body of data already 
made available by Rokeach, Smith, an 
Evans (1960), by Byrne and hp 
(1962), and by Stein, Hardyck, ап 
Smith (1965), The first study cite 
clearly shows that white subjects in the 
North and the South rate incongruous 
race-belief Configurations such 2? 
Negro who believes in God and white 
person who is an atheist not by com; 
Promise, as the congruity model wou и 
Predict, but by assimilation, the evalua” 
tion of the total configuration CS er 
more or less completely assimilated e 
that of C—the belief said to cia 
terize the Negro or white. Simi J- 
results were obtained for Jewish a 
dren when they were presented уза 
congruous and incongruous configu’ 
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tions depicting Jews and Gentiles hold- 
ing various beliefs. The Jewish chil- 
dren positively evaluate Gentiles who 
agree with their views (e.g., Gentile 
who is for Israel) and negatively 
evaluate Jews who disagree with their 
views (e.g., Jew who is against Israel). 
The total configurations (CS) are 
evaluated in essentially the same way 
the belief is evaluated in isolation (C) 
and, furthermore, regardless of whether 
the person holding the belief is a Jew 
or Gentile (S). Similarly, Byrne and 
Wong (1962) and Stein, Hardyck, and 
Smith (1965) presented fictitious per- 
sonality profiles of whites and Negroes 
to their white subjects, some profiles 
being similar to and some different 
from the subjects’ own profiles. The 
subjects evaluated positively those 
characterized as having personality 
profiles similar to their own and evalu- 
ated negatively those characterized as 
having profiles different from their 
own, regardless of whether the profile 
belonged to a white or Negro. 

We may tentatively conclude from 
all the preceding that the subjects are 
responding in accord with the principle 
of belief congruence, which predicts as- 
similation whenever extremely im- 
portant characteristics (beliefs ог 
traits) are associated with positively 
and negatively valued people and not 
in accord with the congruity principle, 
which predicts compromise. But these 
Studies do not provide us with a defini- 
tive test of the two principles because 
they employed research designs some- 
what different from those of Osgood 
and his co-workers, and they did not 
employ the semantic differential. We 
therefore designed a new study in- 
volving the semantic differential which 
replicated Osgood and  Ferguson's 
word combination study (Osgood et al., 
1957, p. 275) except in one significant 
respect: we deliberately tried to select 
Various concepts which, when presented 
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separately, would activate more or less 
highly polarized beliefs, thus ensuring 
that the predictions regarding the out- 
come of cognitive interaction by the 
principle of belief congruence would be 
diametrically opposed to those made by 
the congruity principle. As in the Os- 
good and Ferguson study, we selected 
components whose combinations were 
assumed to be credulous, and the pur- 
pose of the study was the same as 
their’s, namely, the prediction of the 
evaluative meaning of word combi- 
nations.® 

Forty-two white subjects, enrolled 
in an introductory psychology course at 
Michigan State University in the sum- 
mer of 1961, rated 22 component con- 
cepts and 12 combinations shown in 
Tables 1 and 2 on three semantic dif- 
ferential scales representing evaluation 
(valuable-worthless; admirable-deplor- 
able; good-bad). We used three types 
of assertions: simple linguistic qualifi- 
cation, statements of classification, and 
source-object assertions. The subjects 
first rated the 22 individual concepts 
on each of the three scales, and then 
rated 12 assertions linking 2 con- 
cepts. For the exact procedure see 
Osgood, Suci, and Tannenbaum (1957, 
p. 275). Predicted evaluation scores 
ranging from 1 to 7, which is, of 
course, equivalent to a range from —3 
to +3, were calculated for each subject 
separately on the basis of the con- 
gruity formula (see Footnote 4) and, 
then, the mean predicted (from the 
congruity model) and obtained scale 
scores were calculated for all subjects. 

Table 1 shows that the obtained 
means are reasonably close to the con- 
gruity model's predicted means in only 
4 out of the 12 combinations (NEGRO 


5 Actually, we obtained data on potency 
ctivity as well as on evaluation. Since 
the evaluative dimension is of major concern 
to attitude theory and research, only the 
evaluation data will be reported here. 


and a 
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TABLE 1 


MEAN ERROR OF PREDICTIONS AND SIGNIFICANCE TESTS OF DIFFERENCES BETWEEN MEAN 
OBTAINED EVALUATION SCORES FOR COMBINED CONCEPT AND MEANS PREDICTED BY THE 
i CONGRUITY MODEL AND THE BELIEF CONGRUENCE MODEL 


Mean 
(hssertion. M SD error t 
1. A WHITE PERSON Predicted: congruity 3.75 14 15 moge 
" i 2 5 
who asa Obtained 2.06 1.05 a gp 
COMMUNIST Predicted: belief congruence 2.50 -65 
2. A WHITE PERSON Predicted: congruity 4.43 1.43 " 
Я 1.51 7.40** 
dios Obtained 2.92 1.46 
who is an жї 2.22* 
ATHEIST Predicted: belief congruence 3,43 A3 
3. A NEGRO Predicted : congruity 6.32 97 
10 .83 
who believes Obtained 6.42 .85 
.07 51 
in Gop Predicted: belief congruence 6.35 .28 
4. A NEGRO Predicted: congruity 5.96 98 
.07 .50 
who is an Obtained 5.80 124 
.02 10 
ANTICOMMUNIST Predicted: belief congruence 5.91 12 
5. MY MOTHER Predicted: congruity 4.42 4 
2.38  13.84"** 
IS INSINCERE Obtained 2.04 .96 
.42 1.62 
Predicted: belief congruence 2.46 1.60 
6. UNIVERSITY PROFESSOR Predicted: congruity 4.37 140 
147 5.90*** 
favors EXTRAMARITAL Obtained 2.90 1.56 
.01 .04 
SEXUAL RELATIONS Predicted: belief congruence 2.89 75 
7. CLARK GABLE Predicted: congruity 3.56 121 " 
4 4" 
was in favor of Obtained 2.62 123 
da -50 
FIDEL CASTRO Predicted: belief congruence 2.73 -84 
8. DISHONEST Predicted: congruity 3.36 .88 н 
1.56 9.28 
ATHLETE Obtained 1.80 85 
26 149 
Predicted: belief congruence 2.06 85 


че 


MopeEts or COGNITIVE INTERACTION 1 


[20] 
on 


TABLE 1—Continued 


Mean 


Assertion M SD error t 
9. UNFAITHFUL Predicted: congruity 3:73 17 
1.60 8.54*** 
ROMANCE Obtained 213 1.19 
å а 39 1.63 
Predicted: belief congruence 2.52 1.11 
10. NIKITA KHRUSHCHEV Predicted: congruity 5.25 1.09 
| AT NE 
advocates CLOSE Obtained 5.08 1.57 
а | 46 1.64 
FAMILY TIES Predicted: belief congruence 5.54 1.12 
11. nussiA extends Predicted: congruity 5.66 1.10 
А 24 .80 
FREEDOM OF Obtained 1.70 
. я 13 А5 
THE PRESS Predicted: belief congruence .88 
12. A PROSTITUTE Predicted: congruity 3.90 1.35 
1.05 5.97*** 
2.85 1.58 


who looks like Obtained 


GRACE KELLY 


Predicted: belief congruence 4.10 1.53 


a £ tests for correlated measures between obtained and predicted (congruity) scores. 
eans predicted by the belief congruence model were corrected for 


b All t's between obtained means and mi 
heterogeneity of variance, whenever necessary (E 
.05. 


жер < 1001. 


who believes in сор; NEGRO who is an 
ANTICOMMUNIST; NIKITA KHRUSH- 
CHEV advocates CLOSE FAMILY TIES; 
RUSSIA extends FREEDOM OF THE 
PRESS). Statistical tests, shown in 
Table 1, indicate that the obtained and 
predicted means for these four asser- 
tions are not significantly different 
from one another, results required by 
the congruity model. On the other 
eight assertions, however, the obtained 
means deviate markedly from the con- 
gruity model's predicted means, and in 
all eight instances are significantly dif- 
ferent from one another at the .001 
level, 

It will be further noted in Table 2 
that in the majority of the assertions 
the evaluative meaning of CS deviates 
markedly from that of S and adheres 
closely to that of C. In 6 out of the 


Edwards, 1960, pp. 106-108). 


12 assertions (Assertions 2, 3, 4, 5, 6, 
7) CS does not differ significantly from 
C. In one instance (WHITE PERSON 
who is a соммоміѕт), CS is even 
more negatively evaluated than C, and 
significantly so—a_clear instance of 
overassimilation. Finally, in Asser- 
tions 8 and 9, the evaluative meaning 
of CS is almost but not quite com- 
ated to C. When we con- 
sider the results of Assertions 1 
through 9 all together, it is seen that 
the obtained results for CS are gen- 
erally poorly predicted by the con- 
gruity principle. It is only in the 

с In only two of these nine assertions—the 
two Negro configurations—do the obtained 
means differ insignificantly from the congru- 
ity model's predictions (Table 1). The 
closeness of predictions in these two instances 


may be spurious; the disparity between 5 
and C is very small and, consequently, it is 


pletely assimil 
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TABLE 2 


MEAN DIFFERENCES BETWEEN THE EVALUATION OF THE COMBINED 
CONFIGURATION AND EACH OF THE Two COMPONENTS 


Mean 
M difference t 
1. WHITE PERSON 5.35 
3.29 14.95*** 
A WHITE PERSON who is à COMMUNIST 2.06 
39 2.60* 
COMMUNIST 245 
2. WHITE PERSON 5.35 
242 8.96*** 
A WHITE PERSON who is an ATHEIST 2.92 
232 1.78 
ATHEIST 3.24 
3. NEGRO 5.38 
1.04 6.50*** 
A NEGRO who believes in Gop 6,42 
.10 ‚91 
GOD 6.52 
4. NEGRO 5.38 
51 2.83** 
A NEGRO who is an ANTICOMMUNIST 5.89 
.07 .39 
ANTICOMMUNIST 5.96 
5. MY MOTHER 6.51 
347 19.60*** 
MY MOTHER is INSINCERE 2.04 
A8 1.38 
INSINCE 1.86 
6. UNIVERSITY PROFESSOR 5.85 
2.95 8.04*** 
UNIVERSITY PROFESSOR favors EXTRA- 
MARITAL SEXUAL RELATIONS 2.90 
25 1.32 
EXTRAMARITAL SEXUAL RELATIONS 2.65 
7. CLARK GABLE 5.26 
А 2.64 9.78*** 
CLARK GABLE was in favor of FIDEL CASTRO 2.62 
26 1.37 
FIDEL CASTRO 2.36 
8. ATHLETE 5.84 5 
4.04 20.20* 
DISHONEST ATHLETE 1.80 


27 2.25* 
DISHONEST 1.53 
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TABLE 2—(Continued) 


„Mean 
M difference 1 
9. ROMANCE 6.09 
3.96 17,22*** 
UNFAITHFUL ROMANCE 2.13 
A9 2:97** 
UNFAITHFUL 1.64 
10. NIKITA KHRUSHCHEV 3.19 
1.89 CA 
KHRUSHCHEV advocates CLOSE FAMILY TIES 5.08 
1.35 5.40*** 
CLOSE FAMILY TIES 6.43 
11. RUSSIA 4.08 
1.34 4.19*** 
RUSSIA extends FREEDOM OF THE PRESS 5.42 
73 2.52* 
FREEDOM OF THE PRESS 6.15 
12. PROSTITUTE 2.33 
52 3.25* 
PROSTITUTE who looks like GRACE KELLY 2.85 
2.82 10.68*** 
5.67 


GRACE KELLY 


* p «.05. 
жр < .01. 
жұр < .001. 


case of Assertions 10 and 11 that the 
obtained evaluations are clearly pre- 
dicted by the congruity model (Table 
1). Finally, it is noted that only in 
the case of Assertion 12 is the mean 
evaluation of CS (PROSTITUTE who 
looks like GRACE KELLY) closer to S 
than to C, and here too the obtained 
mean differs significantly from the con- 
gruity model’s prediction. 

Let us now consider the extent to 
which the obtained results of CS are 
predicted by the principle of belief con- 
gruence which asserts that it is the 
relative importance of the two compo- 
nents perceived with respect to опе 
another which is the crucial determi- 


Virtually impossible for the obtained results, 
пана that they fall somewhere in be- 
Ween the component means, to differ sig- 
nificantly from the results predicted by the 
Ongruity principle. 


nant of the evaluative meaning of the 
combined configuration. To generate 
specific predictions from this theory it 
is necessary to have independent meas- 
ures of relative importance. Since we 
were not able to obtain such measures 
on the original sample of 42 subjects 
(for the simple reason that at the time 
of this study we had not yet fully de- 
veloped the model presented here) we 
obtained the required measures of rela- 
tive importance on a second comparable 
sample of introductory psychology stu- 
dents tested in the fall of 1963* We 
proceeded on the assumption, (a) that 
the average evaluative meaning of 
the components considered separately 
(shown in Table 2) would be approxi- 
mately the same in the two samples, 

т We wish to thank Jacob Jacoby for his 


invaluable help in collecting and analyzing 
these data. 
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and (b) that the relative importance of 
C and S in the context CS would also 
be approximately the same for the two 
samples. Thus, we predicted from 
measures of relative importance ob- 
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tained from one sample to measures of 
evaluative meaning of combinations ob- 
tained from another, presumably com- 
parable sample. 

The instructions * were as follows: 


This is a scientific study in the meaning of words and combinations of words. There are 
no right or wrong answers. What we want is your personal opinion, 


Consider, for example, the combination “IRRESPONSIBLE FATHER,” 


P асе 
1. How do you feel about IRRESPONSIBLE FATHER? Оп the rating scale below indicat 
with a check mark how strongly you approve or disapprove of IRRESPONSIBLE FATHER. 


| | =| | 


D 
Ch 
= 


| 
1 2 5 6 7 
Strongly Strongly 
disapprove approve 


г " ; A h 
2. In rating IRRESPONSIBLE FATHER the way you did, how important, that is, how тше 

weight did each word, IRRESPONSIBLE and FATHER, have in determining your rating of eis 

SPONSIBLE FATHER? Did you feel about the combination IRREsPONSIBLE FATHER the Sar 


way you felt about the single word IRRESPONSIBLE or the same way you felt about RATHER 
Estimate how much weight IRRESPONSIBLE and FATHER had in determining the way you act 
ally rated the combination IRRESPONSIBLE FATHER. 


а. My feelings about IRRESPONSIBLE completely (100%) determined the way I rated 
IRRESPONSIBLE FATHER, 


Now proceed to question b. 


.... Now proceed to question c. 


b. In fact, my feelings about IRRESPONSIBLE FATHER 


are г iy 
eelings about oth are even more extreme than П 
feelings about other people who are IRRESPONSIBLE, 


ч s z are 
No, my feelings about IRRESPONSIBLE FATHER and IRRESPONSIBLE а 
about of equal strength. 


SOR or Oe Yes, my feelings about IRRESPONSIBLE FATHER are even stronger than MY 
feelings about other people who are IRRESPONSIBLE. 
bit 


— How much stronger? Slightly stronger (1% stronger)? Quite ? Му 


stronger (50% stronger)? Much, much stronger (100% stronger)? 
best guess is (fill in blank at left). 


Do not answer с. Go on to the next page. 


с. My feelings about IRRESPONSIBLE did not completely determine my rating of the ER 
bination IRRESPONSIBLE FATHER, | would guess that irresponsiBLe influence 


P P nd 
me about ... percent in determining my rating of IRRESPONSIBLE FATHER 4 
FATHER influenced me about .... --. percent, 


“No” to 
estion 
Jative 


lative 


Two additional examples then fol- If the subject responded 
lowed—GOD-FEARING THIEF and TRAI- Question 2a he proceeded to Qu 
TOROUS AMERICAN—using the identical 2c, in which he estimated the re 
instructions. Then subjects were pre- fhe quantitative measurement. of € 


sented with the 12 assertions already ^ importance proved to be unusually comp 
discussed, in each case using the same cated from a technical standpoint. Tif y m 
“programed” format. The order of jects exhibited considerable di “relative 
Р $ E understanding what was meant by puer 
presentation of the 12 assertions was importance" Two preliminary studies 


randomized from subject to subject. necessary to perfect these instructions. 
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importance of C and S in percentage 
terms. However, if the subject re- 
sponded “Yes” to Question 2a he 
proceeded to Question 2b, thus per- 
mitting a determination of whether CS 
was more important than C, and if so, 
how much more important. 

The total number of subjects was 71. 
Of these 14 were eliminated because 
they obviously did not understand the 
instructions: they left one or more 
questions unanswered, or the two per- 
centage estimates given in Question 2c 
did not add up to 100%, or they pro- 
ceeded to answer Question 2b when 
they should have answered Question 
2c, or vice versa. 

Since we did not obtain semantic dif- 
ferential scores for the components for 
these subjects we assumed each sub- 
ject as having the mean component 
Scores shown in Table 2. We then 
computed a predicted score for each 
combination (CS) for each subject 
following Formulas 1 or 2, whichever 
was appropriate. It is the distribution 
of these predicted scores for CS, de- 
rived from one group, which were then 
compared with the obtained CS scores, 
derived from a comparable group. 

It is necessary to emphasize that the 
results shown in Table 1 are group 
Means. Predictions were actually 
made for each subject and for each 
assertion separately. In 38% of all 
the assertions (a total of 684 asser- 
tions) the subjects judged C to be 
100% important (and S 0% impor- 
tant) in determining the meaning of CS 
(assimilation) ; in 16% of all asser- 
tions, the subjects further judged CS 
to be more important than C (overas- 
Similation) in varying degrees. 

It will be seen that the obtained 
Means conform on the whole reason- 
ably closely to those predicted by the 
elief-congruence model. The obtained 
means differ significantly from the pre- 
dicted means in only 3 of the 12 asser- 
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tions (1, 2, and 12), and on the remain- 
ing 9 the obtained means do not differ 
significantly from the predicted means. 
In contrast, only 4 of the 12 obtained 
means differ insignificantly from the 
congruity model’s predicted means. 

A more sensitive test of the predic- 
tive power of the two models is ob- 
tained by comparing the mean absolute 
error of the two sets of predictions— 
the difference between obtained and 
predicted means. On Assertion 1, for 
example, the congruity model’s mean 
error is 1.69 while the belief-congruence 
model’s mean absolute error is .44. 
The mean error of prediction for all 12 
assertions is 1.07 for the congruity 
model but only .34 for the belief-con- 
gruence model. Thus, the belief-con- 
gruence model’s average error is only 
about one-third (.34) that of the con- 
gruity model’s (1.07).° 

One possible objection to our pro- 
cedure is that the predicted and ob- 
tained CS means were not derived 
from the same subjects. We there- 
fore also compared the predicted means 
with the means obtained from Question 
1 (see instructions) in which the same 
subjects evaluated CS on a simple 7- 
point rating scale ranging from 
“strongly disapprove” to "strongly ap- 
prove." In other words, obtained 
means were derived írom the sub- 
jects’ responses to Question 1, and pre- 
dicted means were derived from their 
responses to Question 2. 

The overall results are 
lar to those already presen 
1). The mean error of pre 


9 On the first preliminary study designed 
to measure relative importance, the mean 
error for all 12 assertions was 56 (N — 11), 

preliminary study the 


and on the second m 
mean error was - (М = 63). The final 
study, on which the mean error was 34, 


differs from the preliminary studies in one 
major respect: it included an evaluative 
rating of CS (see Question 1 on instruc- 
tions) as well as ratings of relative im- 


portance. 


highly simi- 
ted (Table 
diction for 
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Assertions 1 through 12 
tively : .34, .27, .04, .06, .77, .01, .55, 
.10, .07, .63, .53, and .82. For all 12 
assertions the mean error of prediction 
is .35. 


are, respec- 


FURTHER CONSIDERATIONS CONCERN- 
ING THE VALIDITY OF THE Two 
PRINCIPLES 


It is clear that the data presented 
here are considerably more in line with 
the belief-congruence principle than 
with the congruity principle. More- 
over, the relatively large size of the 
congruity model’s prediction errors 
would appear to be at variance with 
the results previously presented by Os- 
good and Tannenbaum (1955) and by 
Osgood, Suci, and Tannenbaum 
(1957) in support of their claims re- 
garding the congruity principle’s gen- 
eral validity. In this section of the 
paper we will try to reconcile the pres- 
ent findings and interpretations (with 
respect to the two principles) with 
those put forward by Osgood and his 
co-workers, by looking more closely at 
the various kinds of evidence they 
have presented in support of the con- 
gruity principle. It will be suggested 
that these previously published data 
provide no more support for the con- 
gruity principle than do the present 
data. 

1. Osgood, Suci, and Tannenbaum 
(1957) write concerning their word 
combination study, 

One estimate—perhaps the crudest—is how 
often the obtained factor scores for the 
combinations fall between the factor scores 
for the components, a result required by the 
congruity formula [p. 280]. 

In their attitude change study Osgood 
and Tannebaum (1955) determine how 
often 

predicted positive changes (+) and predicted 
negative changes (—) show corresponding 
signs in the obtained data [p. 51]. 

The data relevant to such a test, how- 
ever, do not provide us with any esti- 
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mate of the congruity model's accuracy 
other than to tell us that the obtained 
values have moved some amount greater 
than zero in the direction of the pre- 
dicted values—a modest result which 
could just as easily have been pre- 
dicted by a much simpler “regression- 
toward-the-mean" hypothesis. 

2. Osgood, Suci, and Tannenbaum 
(1957) state, 


Another estimate of the accuracy of predic- 
tion is the average magnitude of Чен" 
‚.. between predicted and obtained scor 
for word combinations [p. 280]. 


They explicitly indicate, after looking at 
their data, that the congruity wp 
does not reliably predict the results © ч 
tained on the evaluation factor.” bo 
is therefore no difference between us 
combination study and our's insol 
as the evaluative factor is concerne: 
neither set of data—their’s or our s~ 
is, considered as a whole, in accor 
with the congruity model. In this gon- 
nection we calculated for our evalua 
tion data the correlation between b 
size of the congruity principle's ped 
tion errors and the magnitude of t 
disparity between the components an 
we obtained a rho of +.80. This w 
gests that the greater the disparity ys 
the two interacting components the ow 
apparent the operation of the congrU'^ 
principle. 

3. Osgood, Suci, and Tannenbaum 
(1957) write concerning the Osg°? 
and Ferguson word combination study’ 


«ate 
Still another, and perhaps the best, esti 
of prediction accuracy is the correlati to 
tween predicted and obtained mean facto: 
Scores. For the evaluative tC o6; 
7 = .86; for the potency factor, "^^ 


гапша 


10 But, they add, the congruity id 
does reliably predict the results аргы 
the potency and activity factors. es 
these data it is highly likely that iiy 
parity of component scores was hig oe e 
stricted in their study (as IE ee jf not 
Present study) thus making it difficult, viate 
impossible, for the obtained values bs 
significantly from the predicted values. 


din£ 
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and for the activity factor, r=.90—all 


highly significant [p. 280]. 

In their attitude change study, Osgood 
and Tannenbaum (1955) report a cor- 
relation of .91 between predicted and 
obtained changes on the evaluation 
factor. 

There are two reasons why we are 
reluctant to accept these findings as 
evidence of prediction accuracy. First, 
it is difficult to reconcile the admit- 
tedly negative findings with respect to 
the evaluation scores discussed in Item 
2 above with the high correlations be- 
tween obtained and predicted scores on 
the evaluative factor. Second, and far 
more important, while the high corre- 
lations seem at first glance to be im- 
pressive evidence for the congruity 
model, further reflection suggests that 
the correlations may be spurious. Tf 
the upper and lower values of the two 
components vary from one combination 
to the next, and if the obtained and 
predicted values of the combinations 
fall anywhere between the upper and 
lower values, a positive correlation will 
necessarily result even if the two sets 
of values (predicted and obtained) 
deviate significantly from one another. 
For example, the upper and lower 
limits on Assertion 3 ( Table 2) is 6.52 
for сор and 5.38 for necro. Table 1 
shows that the predicted and obtained 
values for the configuration NEGRO who 
believes in cop fall in between these 
two values (6.32 and 6.42, respec- 
tively). Similarly, the upper and 
lower limits on Assertion 8 are 5.84 for 
ATHLETE and 1.53 for DISHONEST, and 
the predicted and obtained values for 
DISHONEST ATHLETE again fall in be- 
tween these values (3.36 and 1.80, re- 
spectively). When the two coordi- 
nates of Assertions 3 and 8 are plotted 
On a scatter diagram along with the 
co-ordinates for the remaining 10 as- 
Sertions a high correlation is found (r 
= .956) even though the obtained val- 
Чез differ significantly from the con- 
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gruity model’s predicted values in 8 
of the 12 assertions. In other words, 
we too get high correlations between 
obtained and predicted values even 
though our results for the most part 
deviate markedly апа significantly 
from those predicted by the congruity 
principle. It is thus seen that the cor- 
relation between the obtained and the 
predicted values cannot provide a test 
of the accuracy of prediction since the 
correlation is spuriously influenced by 
the particular upper and lower values 
of the two components entering into 
the combination, which inevitably de- 
limit the range within which the pre- 
dicted and obtained values of the com- 
bination will fall. 


ConcLUDING REMARKS 


Of all the major balance theories cur- 
rently extant, the congruity principle 
is virtually the only one which, until 
now, has attempted to make specific 
quantitative predictions regarding the 
outcome of cognitive interaction. It 
thus put itself out on a limb where it 
could more easily be proven inadequate 
or insufficient. It is to the credit of 
the authors of the congruity principle 
that they themselves have drawn at- 
tention to the lack of precision in the 
predictions generated by the congruity 
model. It is also to their credit that 
they have tried to improve the predic- 
tive efficiency of the congruity model 
by positing a number of variables which 
limit its operation. These include a 
quantitative correction for credulity, an 
assertion constant and qualitative cor- 
rections for relevance-nonrelevance, 
derogation-nonderogation, and adjec- 
tive-noun. 

Tn our opinion, all these attempts at 
improving predictive efficiency stem 
from the fact that the basic theory on 
which it is based is psychologically an 
untenable one: congruity theory at- 
tempts to predict cognitive interaction 
solely from a knowledge of measurable 
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properties of the components judged in 
isolation. It is this deficiency which 
the present formulations are designed 
to overcome. We think it fair to say, 
from all the preceding, that despite the 
various attempts by Osgood and his 
co-workers to improve the congruity 
model’s predictive efficiency, that even 
their own data do not provide support 
for the congruity principle, especially 
with regard to the evaluative factor, 
the most important factor in semantic 
differential studies and the one which is 
of major interest to attitude theory and 
research. Consequently, the problem 
of reconciling our present findings and 
interpretations, involving the principle 
of belief congruence, with those of 
Osgood’s would seem to be largely re- 
solved. The data presented here, de- 
signed to test the relative predictive 
efficiency of the congruity principle and 
the principle of belief congruence, 
would seem to be clearly more consist- 
ent with the latter principle. It re- 
mains to be seen whether this principle, 
as formulated earlier (Rokeach, 1960) 
and further elaborated here, will also 
prove helpful in studying other phe- 
nomena involving cognitive interaction, 
including attitude change." 

11 The problem of attitude change con- 
cerns the effects of cognitive interaction on 
changing the subsequent evaluative meaning 
of the components, or, those beliefs-disbeliefs 
activated by the components. While we have 
presented no data here on this issue it may 
be assumed, following Osgood and his co- 
workers, that the principle of belief congru- 
ence would predict effects on components, as 
a function of learning, similar to those pre- 
dicted for the meaning of word combinations, 


Mitton RokEACH AND GILBERT ROTHMAN 


REFERENCES 


AscH, S. E. Forming impressions of per- 
sonality. Journal of Abnormal and Social 
Psychology, 1946, 41, 258-290. 

AscH, S. E. Social psychology. New 
York: Prentice-Hall, 1952. 

Brown, R. Models of attitude change. In, 
New directions in psychology. New 
York: Holt, 1962. Pp. 1-85. ‚а 

Byrne, D., & Woxc, T. J. Racial prejudice, 
interpersonal attraction, and assumed dis- 
similarity of attitudes. Journal of Ab- 
normal and Social Psychology, 1962, 65, 
246-253. А 

Epwarps, А. L. Experimental design ЇЇ 
psychological research. New York: Holt, 
1960. 

Оѕсоор, C. E. Cognitive dynamics in human 
affairs. Public Opinion Quarterly, 1960, 
24, 341-365. 

Oscoop, C. E., Sucr, С. J, & TANNENBAUM, 
P. Н. The measurement of meaning 
Urbana: Univer. Illinois Press, 1957. 

Оѕсоор, C. E, & TaxNENnAUM, P. Н. The 
principle of congruity in the prediction O 
attitude change. Psychological Review 
1955, 62, 42-55. ч 

Кокклсн, M. The open and closed mind: 
Investigations into the nature of belief 5357 
tems and personality systems. New YOT** 
Basic Books, 1960. ; 

Roxeacu, M. The three Christs of УР" 
lanti: A psychological study. New York: 
Knopf, 1964. 

Кокклсн, M. The nature of attitudes. 1% 
International encyclopedia of social 50 
ences. New York: Macmillan, 1965, !? 
press. 

RokrACcH, M., Ѕмітн, Parercra W» & 
Evans, К. L Two kinds of prejudice 00 
one? In M. Rokeach, The open and фе 
mind. New York: Basic Books, 196 
Pp. 132-168. 

Stem, D. D., Harpycx, Janz А. & Surrm 
M. B. Race and belief: An open and a 
case. Journal of Personality and Soc 
Psychology, 1965, 1, 281-289. 


(Received February 18, 1964) 


Psychological Review 
1965, Vol. 72, No. 2, 143-155 


PERSON AND POPULATION AS PSYCHOMETRIC 
CONCEPTS' 


JANE LOEVINGER 


Social Science Institute, Washington University 


Person is the central primitive notion of psychology. Persons consti- 


tute populations, hence (or, that i 
There are no populations of tests 
no means for random sampling. 

invariably expert selection. 
assume random sampling of tests, 
the Q technique, classical reliabil 
tions, and others. 

relation either to t 
testing. Representativeness o 


tion for generalization is reaffirmed, 
representativeness is required precisely w 


able. 


The two problems that are basic to 
psychometrics and differential psychol- 
ogy are the logic of measurement and 
the factorial problem. Under what 
conditions can one be said to measure 
a psychological trait in some more or 
less rigorous meaning of measurement? 
And, what are the basic dimensions of 
individual differences among people? 
Thus, how can we measure? And, 
what is worth measuring? These two 
questions are the framework of this 
essay. 

My thesis is that person and test are 
radically different kinds of concepts 
and that confusion or exchange of them 
leads at best to ad hoc pseudomeasure- 
ments and at worst to error. Person is 
a primitive, undefinable term (Straw- 
son, 1958); surely it is the central 
primitive notion of psychology, though 
rarely acknowledged as such. Popula- 
tion is a collective form, and so is 

1 Preparation of this paper was supported 
by Grant MH-05115 from the National Insti- 
tute of Mental Health, United States Public 
Health Service. It is a revision of the 

residential address to Division 5 of the 
qu erican Psychological Association, Phila- 
elphia, September 2, 1963. 


These techniques seem 
he science of psychology or to the practice of 
{ experimental conditions as the founda- 


s to say), can be randomly sampled. 


, items, or testing conditions, hence, 


Sampling in those realms is almost 


Psychometric methods whose derivations 


items, or testing conditions include 
ity theory and its recent liberaliza- 
inadequately anchored in 


however. Expert attention to 
hen randomness is unattain- 


sample, though less obviously so etymo- 
logically. Test, on the other hand, 
refers to an instrument or artifact of 
our science and contains an implication 
of measurement or indexing. Our 
samples comprise persons; tests are 
analyzed statistically, e.g., they are cor- 
related and factored. Statistical or 
psychometric methods that purport to 
sample tests or items seem to violate 
these most basic notions of psycho- 
logical statistics. But such a position 
stands in opposition to parts of the 
work of such leading psychometricians 
as Burt (1937); Cattell (1952) ; 
Stephenson (1936, 1952); Guttman 
(1944, 1950) ; Coombs (1952) ; Block 
(1961) ; Lord (1955) ; Tryon (1957) ; 
and Cronbach, Rajaratnam, and Gleser 
(1963). | ' 
Often tied to sampling of items 1s 
correlation of persons, i.e., correlation 
of a set of scores generated by or per- 
taining to one person with other cor- 
responding sets of scores. Correlation 
of persons also may arise in other con- 
texts. No attempt will be made here 
to list or to evaluate all such pos- 


sibilities. 
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Some authors, such as Coombs, have 
stressed an abstract equivalence, hence 
interchangeability, of person and test. 
Others, such as Guttman, have used 
an imprecise concept of a “universe of 
attributes,” from which the content of 
particular tests has been derived by a 
sampling process, also somewhat im- 
precise, though explicitly not a random 
process (Guttman, 1950b, p. 54). The 
imprecision has been criticized (Camp- 
bell & Kerckhoff, 1957; Loevinger, 
1955) in a manner anticipating the 
present article. Recently a number of 
psychometricians, particularly Lord 
(1955 and elsewhere) and Cronbach, 
Rajaratnam, and Gleser (1963 and 
elsewhere), have rendered precise the 
notion of sampling items and tests and 
capitalized on it to derive psychometric 
formulas from statistical theory. In 
doing so, they have also opened the 
way for clearer and more precise evalu- 
ation of the notion of random sampling 
of items and tests. 


TEST VERSUS PERSON 


Every schoolboy knows what a test 
is. He might be surprised to know 
that the first, oldest meaning given by 
the dictionary is 


a cupel or cupeling hearth 


1 for refining 
precious metals, 


and the next, 


examination or trial by the cupel; hence, 
any critical examination or decisive trial. 


The educational use corresponds to that 
of the schoolboy : 

Any series of questions or exercises or other 
means of measuring the skill, knowledge, 
intelligence, capacities, or aptitudes of an 
individual or group. 

The meaning that seems closest to the 
general meaning of the term in psy- 
chology is the following: 

Means of trial; specif., subjection to condi- 


tions that show the real character of person 
or thing in a certain particular; as, the 
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tuberculin fest for tuberculosis [Webster's 
New Collegiate Dictionary, 1961, p. 878]. 

The schoolboy meaning, in which 
test stands for a set of items, differs 
from that of psychometricians, who use 
the term rather for the items, adminis- 
tration, scoring key, and norms. 
Plagiarism of each other's items tends 
to be condoned by psychometricians 
because of the importance of norms, 
which can hardly be plagiarized. 
These connotations of the word test 
are not arbitrary, but reflect the reali- 
ties of the situation; the score of а 
person by no means depends on the 
items alone, 

In regard to the concept of person, 
on the other hand, every schoolboy 
does know what the word means, 10 
the same sense that psychometricians 
use it. No one could be enrolled in 
kindergarten who did not have the 
concept of person as a deeply ingraine 
Part of his mentality, who did not un- 
derstand and recognize the identity of 
à person from day to day despite super" 
ficial changes in appearance, and who 
did not know the difference between 
encountering one person and encoun- 
tering two persons, A person retains 
his identity through the extreme 
changes from infancy to old age. T” 
contrast with a person, a test does not 
have an identity through time despite 
superficial changes in administration 
or norms, nor is it always clear whether 
one is confronted with two copies of а 
single test or with two different tests: 

Seemingly minor differences in test 
administration may greatly affect рег" 
formance on many tests; hence, the us¢ 
of norms set for one administrative 
circumstance is inappropriate if the eet 
ministration changes, say, if time lim > 
are altered. Since it requires differ 
ent norms, it is not the same test. 
test may change even though ther r 
no change in items, administration, 0 
norms. This fact is recognized ап 


e i$ 
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dealt with by periodic revisions, in 
some cases by yearly revisions. 

It has been argued (Loevinger, 
1957) that test and retest are best con- 
sidered as two different tests except 
where they are shown to be closely 
similar with respect to mean, variance, 
and correlation with outside criteria. 
In the usual case, where secular trends 
exist, i.e., where test and retest have an 
appreciable mean difference, different 
norms are mandatory ; hence, by defini- 
tion, test and retest are different tests. 
The argument here is slightly different : 
it is ambiguous whether test and retest 
are the same or two different tests. 
The ambiguity over identity differen- 
tiates the concept of test from that of 
person. 

There are, thus, drastic differences 
between person as a construct and test 
as a construct. A person retains recog- 
nizable identity through slight super- 
ficial changes, while a test may or may 
not. We are always clear whether we 
are confronted by one person or by 
two people. People do not, in front of 
one, shade off into one another im- 
perceptibly. Tests, on the other hand, 
may differ in minor ways whose signi- 
ficance is doubtful. Are two forms of 
the same test a single test or two 
different ones? Considering the limit- 
ing form of a test, a single item, how 
great does the change in wording have 
to be before it becomes a new item? 
Some changes in wording may be 
Ignored by most people (e.g. a small 
misspelling or a grammatical error), 
hence be negligible changes. Others 
Тау greatly affect the difficulty or 
Popularity of the item. In general, one 
Cannot predict in advance, but must 
try out both forms. 

, Some psychologists who have prac- 
ticed or advocated correlation of per- 
Sons or sampling of tests or testing 
onditions have acknowledged argu- 
Ments like those presented here; others 
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have ignored the issues. Burt (1937, 
and in earlier writings) probably was 
the first to exploit systematically the 
correlations between persons, though 
he credits W. Stern with the original 
proposal. Thinking primarily of per- 
sons serving as rating instruments, 
Burt makes the point that what is in 
one context a person is in another 
context a test. Hammond (1955) has 
observed that this situation is typical 
of much clinical research, Burt 
(1937) goes on: 

We can safely correlate Persons only when 
the traits or tests fulfill much the same 
conditions that we should require of the 
persons when correlating fests: that is to 
say, as a general rule: (i) the tests must be 
sufficiently numerous to keep the probable 
errors low; (ii) they must form a represen- 
tative or random sample of the total popula- 
tion of tests; (iii) the marks or measure- 
ments obtained from different tests must all 
be reducible to terms of the same unit; (iv) 
the distribution of the measurements should 
be, as nearly as possible, normal; and (v) 
the correlation should be linear. Otherwise 
a correlation between persons is likely to be 
meaningless [p. 68]. 


To the caveat of Burt must be added 
the following, from Cattell (1952) : 
The test population, except for the person- 
ality sphere concept, does mot have the 
qualities of a biological species population. 
The experimenter, indeed, usually does not 
even make any attempt to sample the test 


universe [p. 510]. 


Cattell's case for the personality sphere 
having the character of a biological 
species population rests on the claim 
that personality traits can be counted 
on to be represented in language, hence 
a list of trait names covers the per- 
sonality domain (Cattell, 1946). 

To Burt’s caution and Cattell’s stric- 
ture may be appended some remarks 
of Spearman (1927), made in slightly 
different context. He argued against 
the definition of intelligence as either 
an average or a sample of functions 
on the grounds that to compute an 
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average or to draw a sample certain 
basic postulates must be met that are 
not met in intelligence tests: unequivo- 
cal domain, comparable cases, no repe- 
tition, and no omission. The limits of 
the domain over which sampling or 
averaging takes place must be defined 
unequivocally, a single case or instance 
must be in some respect equivalent to 
every other case or instance, and the 
domain must be defined so as neither to 
omit nor to repeat instances. 


The worst about this theory of sampling, 
however, has yet to be mentioned. It is 
that, in actual practice, the procuring of a 
genuine sample has not really even been 
attempted. One of the many indications of 
this is the prevalent procedure, in the con- 
struction of a series of tests, of trying out 
a large number and then selecting those 
which exhibit the highest correlations with 
all the rest. Such a procedure seems to 
have been more or less influential, directly 
or indirectly, in the framing of all generally 
accredited series at the present day. ... 
In order to obtain a genuine sample, one 
carefully retains, and even adds to, the 
sorts which were at first little represented, 
and which therefore tend to be least corre- 
lated with, the remainder [p. 70]. 


As we shall see, Spearman was de- 
Scribing a kind of representative, not 
random, sampling. Returning to Cat- 
tels assertion that the personality 
Sphere constitutes a population, one 
sees that his argument rests entirely on 
fulfilling the postulate of no omission. 
He makes no claim of unequivocal 
domain, of comparable cases, or of no 
repetition. 


POPULATION AND SAMPLE 


In examining the basic constructs of 
psychometrics more deeply, let us begin 
where statistics began, with the concept 
of population. What is usually con- 
sidered the first statistical tract was 
"Natural and Political Observations 
Made upon the Bills of Mortality" by 
Graunt, published in 1662 (Graunt, 
1956). Graunt noted that parish 
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clerks of London published weekly an 
account of deaths, listed by causes, and 
also of christenings. He collated these 
for the years 1604—61 and showed how 
inferences could be drawn from the 
data. Ages of the casualties were not 
given in the reports, but Graunt made 
some estimates of age from the causes 
of death. The number dying of acute 
diseases he took as a measure of the 
tendency of the climate and air to sud- 
den unfavorable changes; the number 
dying of chronic diseases was a meas- 
ure of the fitness of a country for 
longevity, etc. 

The first life insurance tables were 
published in 1693 by Sir Edmund 
Halley. He used tables of births and 
funerals in the city of Breslaw as à 
source for mortality tables from which 
he calculated the worth of annuities 
taken out at various ages. For this 
purpose, he noted, the bills of mortality 
of London and similar data from Dub- 
lin were not adequate: 

First, In that the Number of the People 
was wanting. Secondly, That the Ages © 
the People dying was not to be had. Ап 
Lastly, That both London and Dublin bY 
reason of the great and casual Accession О 
Strangers who die therein, (as appeared in 
both, by the great Excess of the Funerals 
above the Births) rendred them incapable 
of being Standards for this purpose; which 
requires, if it were possible, that the People 
we treat of should not at all be changed, but 
die where they were born, without any 
Adventitious Increase from Abroad, ог De- 


cay by Migration elsewhere [Halley, 1956, 
p. 1437]. 


Here is the statistical notion of popu- 
lation in statu nascendi. Population 
meant people, and at that time only 
persons. Obviously a somewhat 
broader usage prevails today. Tippet 
(1943) states: 


T shall refer only to populations a, 
of recognizably discrete individuals, €" 
men or electric lamps [p. 79]. 

i 5 
In his usage only denumerable classe 
of objects constitute populations. 
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In current statistical usage most of 
the populations referred to are hypo- 
thetical ones; usage has shifted to re- 
ferring to the measurements as the 
population rather than the objects to 
which the measurements pertain, but 
the latter point is not germane to the 
present discussion. Wallis and Rob- 
erts (1962) state: 


The two most fundamental concepts of sta- 
tistics are those of a sample and a population. 
A sample is often referred to as “the data” 
or “the observations”: numbers that have 
been observed. The population, on the other 
hand, is the totality of all possible observa- 
tions of the same kind [pp. 124-125]. 


And again: 


Population is an abstract concept funda- 
mental to statistics. It refers to the totality 
of numbers that would result from in- 
definitely many repetitions of the same 
process of selecting objects, measuring or 
classifying them, and recording the results. 
A population is, thus, a fixed body of num- 
bers, and it is this general body of numbers 
about which we would like to know [pp. 
139-140]. 


In current statistical usage the no- 
tion of population has been greatly 
generalized from its popular and origi- 
nal meaning. In regard to the concept 
of sample, however, the statistical usage 
is derived from ordinary usage not by 
generalization but by a narrowing of 
meaning. Thus a dictionary definition 
of sample is: 


A part of anything presented for inspection, 
Or shown as evidence of the quality of the 
whole; a specimen [Websters New Col- 
legiate Dictionary, 1961, p. 748]. 


But statistical inference can give in- 
formation only about random samples. 
A section of the Wallis and Roberts 
book (pp. 143-146) is devoted to 
“Randomness vs. Expert Selection.” 
They do not disparage experts or ex- 
pert judgment but emphasize that sta- 
tistical methods are applicable only to 
random not to expert selection. 
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SAMPLING OF TESTS AND ITEMS 


Have any psychological tests been 
marketed whose items are chosen by a 
random sampling method? Is it not 
rather the case that all serious test 
construction projects involve expert 
selection? Here is an enormous dis- 
crepancy. In one building are the test 
and subject matter experts doing the 
best they can to make the best possible 
tests, while in a building across the 
street the psychometric theoreticians 
construct test theories on the assump- 
tion that items are chosen by random 
sampling. 

The objection to all psychometric 
developments that assume random sam- 
pling of items or tests is in the first 
instance that they grossly misrepre- 
sent the actual case, which is almost 
invariably expert selection rather than 
random sampling. But there is also 
implied in the argument a subtler and 
deeper point. The term population 
implies that in principle one can cata- 
log, or display, or index all possible 
members, even though the population 
is infinite and the catalogue cannot be 
completed. Statistical sampling must 
be tied to such a display and indexing 
system, else it cannot be random. 
Tests and items are not that sort of 
thing. There is no meaning to talking 
about populations of tests or items. 
No system is conceivable by which an 
index of all possible tests could be 
drawn up; there is no generating prin- 
ciple. Any definition of the terms 
population and sample, however ab- 
stract, that retains those elements of 
meaning that authorize the use of sta- 
tistical theory will exclude tests and 


test items. 
Q-Sort TECHNIQUE 


Another group of psychometric us- 
ages that violate fundamental meanings 
includes Q technique and Q sort. Take 
as a single and particularly meritorious 
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example Block’s (1961) CQ set. This 
is a set of 100 descriptive phrases by 
means of which professional persons 
make a standardized description of a 
person, patient, class of persons, etc. 
The sorter arranges the statements in 
a given quasi-normal distribution. 
Ordinal numbers are assigned to the 
successive, arbitrary intervals. Each 
person rated has 100 scores, ranging 
in value from 1 to 9, one for each state- 
ment sorted. The 100 items are eclec- 
tic, in some cases semitechnical, but 
theoretically noncommittal. As a more 
or less standardized means for render- 
ing descriptions of patients and other 
persons mutually comparable, the CQ 
set has obvious merit. 

The two fundamental problems of 
psychometrics and differential psychol- 
ogy are evaded, however. The number 
assigned to each item for each person 
rated comes from an arbitrary distri- 
bution. The question of how nonarbi- 
trary numbers, i.e., rigorous measures, 
can be constructed is not only not 
solved, it is foreclosed from view. The 
method may work in a rough and 
ready way, but it cannot contribute to 
understanding the fundamental nature 
of psychological measurement, 

The other basic problem, the fac- 
torial problem, is also in part evaded. 
The items are not advanced on the 
basis of, nor do they contribute to, any 
fundamental insights into the structure 
of human personality. How then can 
the method work even as an ad hoc 
descriptive instrument? Adequate cov- 
erage of the domain of individual dif- 
ferences was obtained not by sampling 
a population of items, for no such 
population exists, but by an unac- 
knowledged and informal sampling of 
persons. Block reports that his set of 
items was modified more than once 

after use and criticism by a number of 
professional colleagues of varying in- 
terests. There are two instances of 
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sampling here. First there is a sample 
of colleagues; then there is the second 
order sample, consisting of patients, 
friends, and relations of each colleague. 
Each professional person presumably 
asked himself, in effect, can I describe 
the distinctive features of the patients 
and other people I know in terms of 
these items? Tf not, what further items 
are necessary to make the list com- 
plete? Thus, the merits of the CQ set 
as a sample of traits derive from an 
informal sampling of people. In this 
case the nonrandomness of the sam- 
pling is of little consequence since 
what is at stake is something more like 
а range than a central tendency. 


RELIABILITY AND 175 REVISIONS 


Cronbach, Rajaratnam, and Gleser 
(1963) have reworked test theory 2D 
terms of generalizability. In adminis- 
tering a test one is not interested in the 
performance on that test per se, but 1n 
generalizing to something else. All 
kinds of reliability and validity can be 
construed as species of generalization, 
and they have done a service in urging 
care in the design and interpretation of 
reliability studies to support the appro- 
priate generalizations. But they ate 
led to base their formulations not only 
on random sampling of persons but 
also on random sampling of items ап 
of testing conditions. The same con- 
siderations hold with regard to test 
conditions as with regard to items- 
Everyone who has a serious investment 
in applied testing seeks optimal and not 
random testing conditions. There аге 
unknown and uncontrollable aspects © 
the test situation, but one works alway 
towards optimization not randomiZa- 
tion, а 

The question of how to generalize 
to performances other than those pea 
pled and to circumstances other pe 
those of the test situation is not trivi4* 
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What is meant by the best items and 
the optimal testing conditions is, in part 
at least, those that best mediate the 
most predictions. 

Cronbach, Rajaratnam, and Gleser 
present random sampling of items as a 
less restrictive alternative to the as- 
sumption of the equivalence of parallel 
forms in classical reliability theory. 
But the assumption of parallel tests 
must be based either on random sam- 
pling of a hypothetical population of 
parallel tests or else on random sam- 
pling within strata of items; in some 
formulations each item constitutes a 
stratum. Thus, if random sampling of 
tests and items is an untenable assump- 
tion, classical theory of reliability as 
well as the revisions of Lord (1955), 
Tryon (1957), and Cronbach, Raja- 
ratnam, and Gleser (1963) must be 
basically modified. 

Lord (1955 and elsewhere) uses the 
assumption of random sampling of 
items without examination of its impli- 
cations as an assumption. Tryon 
(1957) presents an interesting his- 
torical review of the assumptions used 
in classical derivations of reliability 
formulas. He shows that two assump- 
tions have been used, one based on 
division of scores into hypothetical true 
scores and error factors, which he calls 
the Spearman-Yule assumption, and 
the other that all items have equal 
standard deviations and equal intercor- 
relations, which he calls the Brown- 
Kelley assumption. Most authors have 
used one or the other, a few have used 
both. Neither, however, is necessary. 
He defines the reliability coefficient as 
the correlation between a test and a 
hypothetical comparable test composed 
of the same number of items, having 
the same average item variability and 
the same interitem correlation. Clearly 
implied is that the items are from the 
same behavior domain, since the other 
conditions could be satisfied by totally 
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uncorrelated tests. In the simplest 
case, where the items of a test form an 
unstratified composite, the items “are 
as if drawn at random from a large 
pool” of items (Tryon, 1957, p. 233). 
From this it follows that the mean 
covariance of items in the two forms 
will equal the mean covariance of items 
within each form, which is crucial to 
Tryon’s derivations. The assumption 
of parallel tests, which Cronbach, Raja- 
ratnam, and Gleser are concerned to 
replace, is a direct consequence of the 
Brown-Kelley assumption, or, if one 
begins with the Spearman-Yule as- 
sumption, is needed to bring the esti- 
mates of true score and error factor 
within reach. 

In the present view, then, the at- 
tempts by Tryon and by Cronbach, 
Rajaratnam, and Gleser to derive re- 
liability formulas on less restrictive 
assumptions than have been used in 
the past have not been fundamental 
enough. The modified assumption still 
involves sampling of items. A pre- 
vious criticism of classical reliability, 
that reliability cannot be defined in a 
noncircular manner (Loevinger, 1947, 
1957), is another, and perhaps more 
searching, way of looking at the same 
basic problem. To show how the pres- 
ent considerations can be related to 
the critique of reliability in terms of 
circularity, consider Tryon's system. 
Tryon assumes that tests comprise 
items randomly drawn from a single 
"behavior domain.” How then does 
one establish, as the test constructor 
must, that two items are drawn from 
the same behavior domain? There can 
hardly be any answer other than the 
one usually given in textbooks: the 
items within a single domain are inter- 
correlated to the extent of their re- 
liabilities. Here reliability is used to 
define the behavior domain, just as the 
behavior domain construct is used to 


define reliability. 
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Bnuxswik's PRINCIPLE or REPRE- 
SENTATIVE DESIGN 


Brunswik (1947) advocated substi- 
tuting representative for systematic de- 
sign of psychological experiments, He 
emphasized that before making psy- 
chological generalizations one must 
consider how far the conditions of the 
supportive experiments represent the 
organism's natural ecology. 


Representative sampling is extended from the 
subjects to the objects, from the individuals 
to the stimulus situations and tests [Bruns- 
wik, 1947 p. 56]. 


At first his position seems to be directly 
contrary to that of the present paper. 
However, he did not consider precisely 
the present point, ie, the difference 
between expert selection of a repre- 
sentative sample and truly random 
sampling. Statistical sampling theory 
applies only to random sampling, but 
a judgment of representativeness un- 
derlies all probabilistic generalization. 
The question of whether something like 
random sampling can be assumed to 
have occurred must be judged sepa- 
rately in every different context, 

In commenting on studies of social 
perception where people serve both as 
Subjects (in effect, as tests) and as 
objects of study, Brunswik noted that 
the subjects are usually numerous, and 
a correlative attention is paid to the 
Tepresentativeness of the sampling. 
Concerning the objects of social per- 
ception, he showed that in many studies 
they were too few, and that little atten- 
tion had been paid to their representa- 
tiveness of any population. In clinical 
research the situation noted by Bruns- 
wik is most often reversed : the persons 
studied may be numerous and chosen 
with some care, whereas the clinicians 
serving as measuring instruments are 
usually few and unrepresentative of 
any definable class of clinicians or peo- 
ple. Brunswik’s plea, that research 


Workers in social perception attend to 
the representativeness of persons serv- 
ing as objects, needs to be turned 
around in clinical research. To pro- 
tect the generality of the findings more 
attention should be paid to the repre- 
Sentativeness of people serving as 
subjects. 

The more one pursues this line of 
thought, the more arbitrary it seems 
what one calls object and what one 
calls subject; and the more one comes 
to Burt’s and, later, Hammond’s point, 
that what is from one view a person 
is from another view a test. Ap- 
parently, whatever the role of persons 
in the experimental design, whether as 
subject or object, notions of random 
sampling are appropriate in that di- 
mension, and probably a serious at- 
tempt at something like random 
sampling should occur far more fre- 
quently than it now does. 

Several authors have raised the 
problem of generalizing over testing 
conditions, and this again is reminis- 
cent of Brunswik’s discussion of the 
Narrowness of the environmental ume 
ple that enters most psychologica 
Studies. It is difficult to think of 105 
stances where the principle of ecologi- 
cal representativeness would be of value 
in relation to the conditions of testing. 
one essentially always seeks optima 
testing conditions, 

In relation to the content of psycho- 
logical tests, greater representation O 
the variety of situations of everyday 
life in place of the often artificial d 
Striction of subject matter seems a 
needed. Rarely, however, will there 5 
possible anything like a random вал 
pling of life situations. The importance 
of expert attention to the representa 
tiveness of the content is emphasize 
rather than lessened by abandoning 
random sampling model. t 

Goodenough (1949) pointed out m 
tests may be construed as signs ОГ 
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samples of behavior; Loevinger (1957) 
later expanded the point, showing that 
tests are always and necessarily sam- 
ples of behavior as well as signs of non- 
test behavior. But to say that they are 
samples is not to say that they are 
random samples; moreover, they are 
always evaluated in terms of how well 
they serve as signs of nontest behavior. 
Brunswik’s term, representativeness, 
is ambiguous here; it could be taken to 
mean randomness, or, perhaps more 
logically, sign value. Any vagueness 
or ambiguity obscures the point of the 
present essay. 


Randomness is a property not of an indi- 
vidual sample but of the process of sampling. 
+». A sample of size n is said to be a 
random sample if it was obtained by a 
Process which gave each possible combina- 
tion of s items in the population the same 
Chance of being the sample actually drawn. 
: » » Nonstatisticians usually assume that the 
importance of randomness arises from the 
"fairness" and lack of bias with which such 
Samples represent the population. This is 
important, of course, but of more importance 
is the fact that the pattern of sampling 
variability for any population is known if, 
but only if, the sampling is random [Wallis 
& Roberts, 1962, pp. 141-142]. 


Brunswik’s point, the necessity of 
Tepresentative design as a foundation 
for generalization, a point elaborated in 
relation to a number of significant psy- 
chometric problems by Cronbach, Raja- 
Tatnam, and Gleser, neither requires 
Nor justifies assumption of random 
sampling. 


Rascu’s PSYCHOMETRICS 


Rasch (1960) has devised a truly 
new approach to psychometric prob- 
lems, although one of somewhat limited 
applicability. He makes use of none 
of classical psychometrics, but rather 
applies algebra anew to a probabilistic 
model. The probability that a person 
will answer an item correctly is as- 
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sumed to be the product of an ability 
parameter pertaining only to the person 
and a difficulty parameter pertaining 
only to the item. Beyond specifying 
one person as the standard of ability 
and one item as the standard of diffi- 
culty, the ability assigned to an individ- 
ual is independent of that of other 
members of the group and of the par- 
ticular items with which he is tested ; 
similarly for the item difficulty. The 
model of course is not universally ap- 
plicable. Indeed, these two properties 
were once suggested as criteria for 
absolute scaling (Loevinger, 1947) ; at 
that time proposed schemes for abso- 
lute scaling had not been shown to 
satisfy the criteria, nor does Guttman 
scaling do so. Thus, Rasch must be 
credited with an outstanding contribu- 
tion to one of the two central psycho- 
metric problems, the achievement of 
nonarbitrary measures. 

Note that Rasch is concerned with a 
different and more rigorous kind of 
generalization than Cronbach, Rajarat- 
nam, and Gleser. When his model fits, 
the results are independent of the sam- 
ple of persons and of the particular 
items within some broad limits. With- 
in these limits, generality is, one might 
say, complete. Of course, one only 
studies particular items and particular 
people, and the problem of generalizing 
comes up again in setting the limits 
within which the model holds. The 
people can be considered a random 
sample from some population, but the 
items cannot. Rasch suggests that the 
model be turned around and be used to 
select items that fit it. If this is done, 
the goodness of fit must be evaluated 
with a new sample of people. The 
question of what traits the given items 
yield information about is part of the 
factorial problem; in relation to this 
problem Rasch’s current methods have 
not enlarged our understanding. 


152 JANE LoEviNGER 


PERSON AS A PRIMITIVE NOTION 


Strawson (1958, 1959) has ex- 
amined philosophically how man con- 
structs and apprehends his universe. 
Our frame of reference is a spatio- 
temporal one; within this framework 
material bodies are the basic particu- 
lars, ie., the particulars that can be 
identified, both distinguished and re- 
identified, without reference to any 
other kind of particulars. 


Private experiences have oíten been the 
most favoured candidates for the status of 
“basic” particulars; on the present criteria, 
they are the most obviously inadmissible 
[Strawson, 1959, p. 41]. 


The category of persons is basic in 
a different but related way. 


The admission of this category as primitive 
and underived [is] a necessary condition of 
our membership of a non-solipsistic world 
[Strawson, 1959, p. 246]. 


In acquiring the idea of a person, we 
learn to assign certain predicates to 
people, to ourselves at the same time as 
to others. In particular, the meaning 
of the predicate to be depressed is the 
conjunction of the behaviors by which 
we assign it to others and the feelings 
by which we assign it to ourselves. 


The dictionaries do not give two sets of 
meanings for every expression which de- 
Scribes a state of consciousness: а first- 
person meaning and a second-and-third-per- 
son meaning [Strawson, 1959, p. 99]. 


On purely logical grounds Strawson 
has thus arrived at a position similar 
to that which Hebb (1960) supports 
on psychological grounds, i.e., that we 
learn about ourselves to a considerable 
extent the same way and at the same 
time that we learn about others. The 
position is consonant with the develop- 
mental findings of Piaget. 

Let us take off from Strawson's 
position as a starting point. Tests, 
since they are processes rather than 


material bodies, are certainly not basic 
particulars, as persons are. Tests are 
rather a means of assigning predicates 
to persons; they are "conditions that 
show the real character of a person 

. in a certain particular." Tests are 
instruments of psychologists and as 
such are arbitrary. Presumably, how- 
ever, there are nonarbitrary long-last- 
ing differences among persons to which 
the relations among tests provide clues. 
That is precisely the factorial problem. 
The factorial problem has been a focal 
one for many of our greatest psycho- 
metricians—Spearman, T. L. Kelley, 
and Thurstone. А 

The factorial problem in its classic 
form springs directly from the concepts 
of person and test in their receive 
meanings. Proposals to sample items 
or tests and some of the schemes tO 
subject persons to statistical analys!$ 
are made in defiance of accepted usage 
How can so many able psychome 
tricians be so wrong? Or to state the 
matter noncommittally, though I a? 
not uncommitted, how can trained а 
ple differ by so much as I differ fro” 
what seems to be a majority of my 
colleagues? An answer comes from 
Polanyi, who, prior to becoming 4 
social scientist, was a physician am 
later one of the leading physical PERS 
ists of the world; he thus brings 
social science an intimate knowledge е 
the very arts and sciences that be 
scientifically rigorous among us oe) 
desire to emulate. Polanyi (19 d 
decries the aspiration of many psy "£^ 
Ogists and other social scientists ci- 
construct a completely objective dis a 
pline, to renounce any commitment 2° 
an unscientific embarrassment, ап dif- 
depersonalize their science. The an 
ferences among psychometricians € 
be described in his terms. 


a: 
I am committed to the meaning 


+ a ap 
Person as it occurs in psychology 7... 
in everyday life; to the meaning © 
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as it is used, e.g., by the American 
Psychological Association Committee 
on Tests; and to the meanings of sam- 
ple and population as they are used in 
statistics. Those who use methods 
exchanging persons and tests seem to 
me to be saying: “It doesn’t matter 
what we call a person and what we call 
a test. The formula simply calls for 
a set of commensurate scores. If we 
are completely objective, what is the 
difference whether it is the score of 
several people on one test or of one 
person on several tests? How do we 
know whether we are correlating per- 
sons or tests? How do we even know 
whether the person has failed the test 
or the test has failed the person? If 
we are objective, how do we know 
whether we are sampling persons or 
sampling items?” 

For myself, I know. My position is 
both more objective, and less objective 
than theirs: more objective in the sense 
that I believe that there are objective 
realities independent of my mind that 
exist out there and that I as a scientist 
essay to comprehend ; less objective in 
the sense that I acknowledge a personal 
commitment to what is meaningful and 
important. I am not indifferent to 
Whether statistics. are applied in a 
meaningful or meaningless way. On 
the basis of my commitments I know 
that it is meaningful to sample persons 
and not to sample tests, that persons 
pass or fail tests, but tests do not pass 
or fail persons; i.e., persons, not tests, 
make up populations, and that persons 
are variables only insofar as they serve 
as tests, 

Polanyi argues that a scientist who 
acknowledges his personal commitment 
has taken a self-consistent position, 
while to deny one's commitment is to 
take an intrinsically contradictory posi- 
tion. In what follows his arguments 
Will be applied to psychometrics, which 
he did not consider. In saying it does 
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not matter whether we enter tests 
where the problem calls for persons and 
persons for tests, the psychometrician 
is saying, in effect, "I am completely 
objective, hence uncommitted to the 
received meaning of the words person 
and test." But having taken this stand 
on the strength of noncommitment, he 
is then committed to those usages (for 
me, distorted ones) of statistics that 
result from exchange of persons and 
tests. For ultimately no scientist can 
work and remain uncommitted. The 
moment he enters his laboratory or sits 
down to his typewriter he is making a 
commitment. Each of us faces a short 
tenure and a vast universe. What we 
choose to work on we are declaring 
thereby meaningful and significant, in 
fact the most significant thing that we 
can do, 

Polanyi (1958) summarizes his 
“fiduciary programme" in the state- 
ment, 

I believe that in spite of the hazards in- 


volved, I am called upon to search for the 
truth and state my findings [p. 299]. 


This is not a subjective or solipistic 
position, It is an affirmation of an 
independently existing and meaningful 
reality. 

To be sure, science has often ad- 
vanced by breaking out of old meanings 
and opening new possibilities, as Ein- 
stein did when he redefined simul- 
taneity. Obviously, population has 
also been progressively redefined. At 
first it meant a class of people; then 
a class of denumerable objects. Per- 
haps what it now means is a class of 
objects or events, usually hypothetical, 
that can be randomly sampled (“the 
totality of numbers that would result 
from indefinitely many repetitions of 
the same process of selecting objects”). 
Thus the burden of the definition is 
shifted to the term random. 


PATTERN RECOGNITION 


Randomness is, like person, almost 
or quite undefinable, but recognizable 
by what Polanyi calls connoisseurship. 
ТА any essay “The Logic of Biology” 
Grene (1961), following Polanyi, 
shows that recognition of pattern is 
the basis of all scientific activity, es- 
pecially scientific theories, She goes 
on, in biology there is a refinement of 
the ability to detect patterns that forms 
the basis for recognition of species. 
No entirely objective criterion has been 
evolved to supplant the naturalist’s 
ability to recognize species, usually in- 
stantaneously. Similarly, in everyday 
life we instantly recognize individual 
persons whom we know. Some larger 
animals are also recognized as indi- 
viduals. Тһе identification of persons 
as a form of pattern recognition and 
the identification of randomness as ab- 
sence of pattern are thus instances of 
an indispensable but often unacknowl- 
edged ability of a Scientist. A similar 
argument is made by Lorenz (1959), 

The favorite Program of the ultra- 
objective psychometrician, the com- 
puter search for patterns, is а carica- 
ture of the pattern-search of a great 
scientist. No computer could have 
found the pattern expressed in New- 
ton’s theory of universal gravitation, in 
the theory of relativity, in thermody- 
namics, or in quantum theory. 

Does random sampling of tests, 
items, and testing conditions represent 
an illuminating new insight, or is it a 
meaningless gimmick, a quantitative 
trick without profound meaning for 
psychology and psychometrics? When 
such a question falls within his domain, 
every scientist has the responsibility, 
whether he acknowledges it or not, to 
apply his own pattern-finding ability 
to it. Crucial points are the difference 
between random and expert selection 
and the problem of continuing identity 


ЈАМЕ LOEVINGER 


in sampled objects. There is little hope 
of touching those who are deeply com- 
mitted to the idea that to be scientific 
means being completely objective, Uu 
committed, detached, and approaching 
all issues in a depersonalized manner. 
Others who have used the methods 
here criticized will be reluctant to wal 
brace the implications of these methods. 
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THEORETICAL NOTES 


SOME METHODOLOGICAL PROBLEMS IN CATTELL'S 
MULTIPLE ABSTRACT VARIANCE ANALYSIS! 


JOHN C. LOEHLIN 


University of Nebraska 


Certain methodological problems in Cattell’s Multiple Abstract Vari- 
ance Analysis (MAVA) technique for the analysis of trait variance into 
hereditary and environmental components are considered. ‘These in- 
clude inconsistencies between and within some of the equations in the 
model and an apparent anomaly in correcting for errors of measurement. 
А new approach to the derivation of MAVA equations is developed, and 
implications for previously published MAVA results briefly discussed. 


Over a number of years, Cattell and 
his associates (Cattell, 1960, 1963; Cat- 
tell, Blewett, & Beloff, 1955; Cattell, 
Kristy, & Stice, 1957) have been develop- 
ing a method for human population 
genetics known as Multiple Abstract 
Variance Analysis (MAVA). This method 
breaks down observed trait variance into 
various components representing the con- 
tributions of heredity and environment 
to variation in the trait. 

Essentially, the MAVA procedure is to 
test various groups of subjects (twins, 
siblings, etc.) reared together or apart 
and to infer from observed trait vari- 
ances in these groups certain theoretical 
hereditary and environmental variances 
and covariances. This is done by writing 
sets of equations, in each of which an 
observed variance is expressed as a com- 
bination of theoretical variances and 
covariances, and then solving the equa- 
tions simultaneously to obtain the theo- 
retical values. These can then be used 
to calculate various nature-nurture ra- 
tios, and the like. 

In principle, the MAVA method is a 
powerful one and should be able to yield 
important information about genetic and 


1 The author wishes to thank R. B. Cattell 
for pointing out several obscurities in an 
earlier version of this paper, and encouraging 
a more systematic critique of the Multiple 
Abstract Variance Analysis model. The 
author would also like to thank K. W. Schaie 
and F. J. Dudek for their comments on the 
present version. 
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environmental influences on traits. It 
contains within it the more traditiona 
twin and foster-sib methods and hence 
provides all the information they on 
plus the additional information derivab 7 
from considering them jointly. How 
ever, the MAVA method in its сштеп 
form appears to suffer from certain a : 
culties, and it is the purpose of t es 
present paper to examine some of ps a 
briefly. Two major kinds of difficulties 
will be considered. The first involves 1" 
ternal inconsistencies in the MAVA ЕП. 
tionsas they are presented by Cattell in hi 
most extensive theoretical treatment © 
the method (Cattell, 1960). The pant] 
is an apparent systematic error in per 
ous applications of the equations to dar, 
by Cattell and his co-workers (Cattell 
al, 1955; Cattell et al, 1957). Alors 
with the examination of these difficulties, 
a new approach to deriving MAVA-ty?” 
equations will be developed, and implice 
tions for previously published MA 
results considered. 


INCONSISTENCIES IN THE MAVA 
EQUATIONS 


system of MAVA equations will in- 
cussed in this section. The first 15, 
consistency between equations ри 
with within- and between-family whe 
ance; the second, inconsistency d in 
handling of selective placement Өй ens 
the equations involving adopted chi 


THEORETICAL NOTES 


Within- and Between-Family Variances 


In order to examine particular dis- 
crepancies in the Cattell equations, it 
will be useful to develop a general form 
of the MAVA equation, from which the 
particular equations for twins, sibs, etc., 
may be derived. (For simplicity, the 
equation will be developed for the usual 
case involving pairs of individuals.) 

Following the general rationale of the 
MAVA method, it is assumed that the 
deviations of the true scores of a pair of 
individuals from the population mean of 


some attribute may be broken into 
additive components, as follows: 
Ху = Xu F Хш TF Xe F ер tn 


Xe = хаһ + Xweg F Voha + сә 


where the x's represent deviation scores; 
wh, we, bh, and be refer, respectively, to 
within-family heredity, within-family en- 
vironment, between-family heredity, and 
between-family environment; and 1 and 2 
designate the two individuals in the pair. 

Now twice? the total variance of the 
trait in the population may be divided 
into two additive portions, с? and oy, 
the former representing variation within 
pairs of individuals, and the latter rep- 


resenting variation between pairs. The 
former can be calculated as: 
1x 
ou = YN - (x1 — хз), [2] 
and the latter as: 
1x 
oe = me (xi Ф ж); [3] 


in each case N represents the number of 
pairs. Substituting in Equation 2 the 
expressions for x; and x» from Equation 
1, we obtain: 

1 N 
OF = У) (хым F Xen К om, t Sa 

2N 4 

— Ханз — Xweg — Voh — Xvg)*; 


2 This is to bring the present development 
into line with that of Cattell, who calculates 
сы? and оъ? as separate estimates of the 
Population variance, with ew? + os? thus cor- 
responding to twice ez? in the general popula- 
tion (Cattell et al., 1955, p. 126). 
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expanding, we have: 
1 à 
ты? = INS (хым? Faot е 


+ 2х. + Eeron t 
+25) ; 


expressing as variances and covariances: 


cv = 3e? F 50а +--+ оьр 
L O N uc, Р тыы 7 
кессе; 


assuming that corresponding 1s апа 25 
are equal in variance and covariance 
(since assignment as 1 or 2 is purely 
arbitrary), and collecting terms: 


2 


Ou =O wh? (1 — тшһшһ») HO we? (1 Расов) 
"Feud — ria) Hoe (1 таре) 
TF 20 who we (Раме Tuohy wea) 
+26 hon (Роны — һә) 
+ 20оь, ("оные — Ране) 
A+ 2o eG bh (reci — oca) 
T 26 w0 be (ава — Tutus) 

+2040 be (bh bey ев): 


[4] 


Now by assuming particular values of 
the correlations in parentheses, this 
general equation may be reduced to any 
one of Cattell's within-family MAVA 
equations. For example, consider sibs 
reared together in the same family (see 
Equation 4 in Table 1, Cattell, 1960) : 


свт? = Gui) + сиё + 2 whweT wh wer 


This equation can be derived from Equa- 
tion 4 above by assuming that: (a) the 
correlations һил ANd Fwerweg аге zero— 
each individual receives an independent 
assortment of the parental genes and 
family influences; (b) the correlations 
Tbhibho and ье, аге unity—the two mem- 
bers come from the same family; (c) 
does not, in general, equal Fwhwez 
below) ; and (d) in all other 
cases, the two correlations within the 
parentheses are equal. Actually this last 
follows automatically from Assumption 2 
— the two r's must be equal in each case 
because one variable is shared and the 
other correlated 1.00. Assumption 3 
brings out a point not obvious in Cattell 5 
version of the equation, that his fwhwe 


Twhywey 
(see comment 
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represents a difference between two cor- 
relations: the tendency for a given hered- 
itary characteristic to select or attract 
certain environmental influences [CA 
and the fact that the genetically influ- 
enced traits of one pair member serve as 
part of the environment to which the 
other is subjected (тикше). Thus cases 
could arise where this term is negligible 
even though both correlations are ap- 
preciable. 

A general equation for between-family 
variance can be derived in the same 
manner as the above, starting from 
Equation 3. This works out to be: 


ei =T (LAP hy whe) FO we? (1 HT weer) 

tov? (1+ ru qui;) cow (1+ riete) 

+26 wh we (тһе T whyweg) 

He eno on rena һы) 

F 26 gw reise, Ре) 

F20 we 0h rois ше) 

1200 recu d ross) 

F2eucw (wies Тыз). [5] 

It will be noted that this is identical to 
Equation 4 for ow", except that the signs 


within parentheses are all positive rather 
than negative. It will also be seen that 


Fu oi = Qo wt? 2042-2002 4-20,2 
Ar chy2e1F wh we +4 ron 15510 whO bh 
TA render оъ +4 veh FeO on 
FHAN webe wO beH Arie mies, 


and that this is equal to 2027. 

Asin the within-family case, we can by 
suitable assumptions about the r’s derive 
Cattell’s various equations. For ex. 
ample, the between-family equation for 
ordinary sibs reared together (see Equa- 
tion 10, Table 1, Cattell, 1960) is: 


Й 
€BN P = Gui? + Ou? + 272 + 20,2 
Freies оное + Arpa Dig, 


To derive this from Equation 5 re- 
quires the same Assumptions 1, 2, and 3, 
as for the within-family case, plus the 
stronger assumption that all the other 7’s 
are zero except љһьг and ье), Which are 
equal, since rj, = 1.00. (Note that if 
the wiwe here and in Cattell's within- 
family equation are the same, rwhyep must 
also be zero.) 
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Proceeding in this fashion to examine 
the rest of Cattell's equations, it Soon 
becomes evident that there are quite à 
few instances in which the assumptions 
required to derive the within-family 
equation are different from, and some- 
times in contradiction with, those ba 
quired for the between-family одага. 
For example, Cattell's equations fo 
identical twins reared together (Equa- 
tions 1 and 8, Table 1, Cattell, 1960) are: 


= Cre” 


erm - 
©втттк? = 20° + Gut + 2шы? + 20v 

Thee 

ка 2*[2v oit uid us 4 ArunneTonTbe 


The first inconsistency involves the 
9? terms, where Trweywep Must be asm 
to be zero to obtain the between-famy 
equation from Equation 5, but where E 
special oy? term in the within-fami Y 
equation allows the twins’ environmen”, 
to be positively correlated—i.e.,. iod 
= Owe(1 — reer)» The second ug i 
sistency involves the within-family = 
redity-environment correlation : to aen 
the within-family equation, one ms 
assume that rov, and ruv; are ваш, 
to derive the between-family equae 
one must assume that they are oy 
(The former assumption is presuma sl 
the correct one, following as it does ye 
lwhwh. = 1.00 and ru, = шмше ® ES 
would lead to the fifth term in the equ 
tion for свгртр? becoming таһыма. 

This last discrepancy in particular, ns 
major consequences for the solutio" 
presented in Cattell's (1960) paper, 45 e 
bears directly on the possibility o A 
parating hereditary and correlated © 
vironmental variance. the 

In his initial presentation of ive 
MAVA method, Cattell did not sol" 
Separately and directly for the effects ce 
heredity within families (the varianc? 
wx?) and correlated environmental йе 
fluences (the Covariance тһе wh we һ oo 
used instead the indirect approac? os- 
Setting rui, to various values and cho 


sg 

ing the value that gave the most ee 
*Such contradictions can be found ethet, 

equations for identical twins reared tof ther 

sibs reared apart, half-sibs reared Children 

half-sibs reared apart, and unrelated С 

reared together. 
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factory results on other grounds—mainly 
in bringing ows? close to съ. The reason 
for the difficulty in the separate solution 
for сш? and ry is clear when one con- 
siders how fwhwe arises. Heredity and 
environment may become correlated 
within families if infants differing in 
genes receive systematically different en- 
vironmental inputs—whether this is due 
to differing reactions of others to genet- 
ically based differences in physique or 
temperament, or due to the infant’s own 
selection of his environment (cf. Cattell, 
1963). How does one separate such 
covariation from more direct effects of 
the genes on the trait in question?  Pre- 
sumably, by finding groups of subjects 
of similar genetic diversity which differ 
systematically in the environmental pres- 
sures brought to bear on the traits, or in 
opportunities for selection of favor- 
able or unfavorable trait environments. 
While such groups might be found, it 
does not appear that the groups the 
present MAVA method uses have been 
selected to vary systematically along 
such dimensions. The difficulty in sepa- 
rating ows? and rune in the present MAVA 
design is therefore understandable. In 
his 1960 paper, however, Cattell pre- 
sents direct solutions for wa? and Fwrw 
which is puzzling, since the data used are 
still the same in relevant respects. An 
examination of the sets of equations sug- 
gests, however, that the solution is an 
artifact resulting from the discrepancy 
noted above. This may be most easily 
seen in Cattell’s Table 2 (Cattell, 1960) 
presenting the 10 equations of his 
"Limited Resources Design." Inspec- 
tion of this table reveals that Equation 1 
contains neither oun? nor ree wives, and 
that in all but one of the other equations 
this variance and covariance stand in a 
constant ratio to each other: a solution 
is thus algebraically possible only because 
of the divergent equation, Number 7— 
and this is the between-family equation 
for identical twins discussed above. If 
the indicated correction of the covariance 
term to 4r ju 18 we is made, this equation 
falls into line with the rest, and separate 
solution for сш? and rus, becomes alge- 
braically impossible. Hence it appears 
that if these variables are to be separated, 
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it must be by incorporating appropriate 
additional groups in the design. 

Alternatively, it might be proposed 
that one estimate the within-family 
values from those between families; but 
this raises the question whether analo- 
gous problems attend the separation of 
heredity and correlated variance in the 
between-family case. The covariance in 
this instance involves two components: 
in part, covariation may result from 
influences existing prior to the birth of 
the subjects in question—systematic re- 
lationships may exist between parental 
genes and family environmental condi- 
tions; and in part covariation may arise, 
as in the within-family case, from en- 
vironmental response to the subject or 
selection of the environment by the 
subject. By the same arguments as in 
the within-family case, it would appear 
that the MAVA design in its present 
form will be unable to separate the 
second kind of heredity-environment co- 
variance from direct hereditary effects; 
the design should, however, be capable 
of estimating correlations due to effects 
of the first kind. 

A final comment may be appropriate 
concerning the relative advantages of 
using within- and between-family vari- 
ance. Ordinarily, if there is a difference, 
the within-family equation will be sim- 
pler than the corresponding between- 
family equation for a given set of as- 
sumptions about 7'ѕ. While Equations 4 
and 5 involve exactly the same variables, 
under typical sets of simplifying assump- 
tions all the terms that vanish from 
Equation 5 also vanish from Equation 4, 
and frequently more, tending to make it 
easier to work with the within-family 
equations. An additional argument for 
using the within-family variances 15 that 
they are probably better behaved in the 
face of sampling inadequacies: if foster 
families, say, come from a restricted 
sector of the population, this will cer- 
tainly make the observed between-family 
variance suspect, but the within-family 
data may still be usable. 


Effects of Selective Placement 


Besides the inconsistencies noted above 
between different equations, there are 
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some cases in the current MAVA model 
where inconsistent assumptions are made 
within equations. An instance is the 
treatment of selective placement effects. 

One of the refinements Cattell intro- 
duced into his 1960 version of MAVA 
was a separate rw," to allow for the 
possibility of heredity and environment 
becoming correlated by selective place- 
ment of infants by adoption agencies. 
However, the effect of such placement on 
the variances is not taken into account. 
Consider the case of sibs reared apart in 
separate families. A placement-agency 
Torbe” is introduced into the between- 
family equation (see Equation 11, Table 
1, Cattell, 1960) to allow for the possi- 
bility that the placement agency may be 
influenced in its placements by knowledge 
concerning the child's biological family. 
Suppose, for example, that the placement 
agency attempts to place each sib in a 
family like the one from which he came. 
To the extent that the agency succeeds, 
this will affect fs», as indicated. But it 
will also affect the environmental vari- 
ance term: the sib pairs will wind up in 
less different families than if they had 
been randomly assigned, and ove should 
be decreased by an appropriate factor 
within pairs and increased between 
pairs. Similar considerations apply in 
the equations for identical twins reared 
apart and unrelated children reared to- 
gether. While in all these cases one may 
argue whether a placement correlation is 
needed, clearly if it is to be included 
placement effects must be allowed for in 
the variances as well. 


CORRECTION ron Ennons 
OF MEASUREMENT 


In his discussions of the MAVA model, 
Cattell has argued persuasively for the 
appropriateness of removing error of 
measurement variance from the observed 
variances prior to entering the theoretical 
equations. In the actual application of 
the MAVA procedure to empirical data, 
however, there appears to have been a 
systematic error. Consider the basic 
measurement operation, measuring a 
trait іп an individual. The measurement 
error involved should be roughly the 
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same whether measuring the trait in an 
identical twin or in a member of an un- 
related pair. And indeed this appears to 
have been the case in the empirical data: 
the reliabilities quoted in Cattell, Ble- 
wett, and Beloff (1955) and in Cattell, 
Kristy, and Stice (1957) fluctuate from 
group to group, as would be expected 
from the small sample sizes, but show no 
Systematic tendency to decrease across 
groups. And yet Cattell's corrections 
do: the error variance removed is 
Systematically less when going from un- 
related individuals to sibs to identical 
twins. This appears to have resulted 
from the use of inappropriate reliabilities 
in formulas of the type (1 — rrr)orr® 2n 
calculating the error variances. Bv this, 
the error variance for a group (in this 
case identical twins) is obtained by multi- 
plying the within-pair variance by the 
complement of the reliability; the reli- 
abilities used, however, were not those of 
the differences from which the within- 
pair variances were calculated, but those 
of the individual test scores. Now the 
reliabilities of differences will not 1m 
general be the same as those of the in- 
dividual Scores, being less as the correla- 
tion between scores increases (Mosier, 
1951, p. 777); hence, use of the original 
score reliabilities in formulas like the 
above will tend to systematically under- 
estimate the error variance for all but 
the general population pairs. This 
underestimation will be progressively 
greater as the pair correlation increases, 
being greatest for the identical twins- 
One can perhaps see this most clearly by 
considering a hypothetical case where the 
"pair" consists of two tests of the same 
individual. Here the “within-pair” vari- 
ance should obviously all be error vari- 
ance—but this result will not follow 1 
the test reliability is substituted in the 
expression above. However, the relia- 
bility of a difference score is zero in this 
case and thus yields a correct result-4 


* Опе need not, in practice, go to all this 
trouble—it can be shown that the use 0 
Mosier’s difference-score reliability formula 17 
Cattell's expression is equivalent to estimating 
the error variance directly from (1 — rt 
where re is the test reliability. The E 
reliability may be obtained either for t 
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The effect of the undercorrection of 
the correlated groups will depend on the 
particular way the observed variances 
enter into the MAVA equations. In 
general, the variance within pairs of 
identical twins will be relatively too 
large (tending to inflate estimates of 
within-family environmental variance), 
and differences between the variances of 
different groups will be too small. Thus 
in the 1955 paper the hereditary variance 
between families (c2), which was ob- 
tained by the subtraction cuz? — 0 s7°, 
was consistently underestimated, leading 
to puzzling difficulties in its relation to 
wr: according to the expectations of the 
authors based on genetic theory (Cattell 
etal., 1955, p. 137), ть? should have been 
equal to or somewhat larger than съв, 
but it persisted in being markedly 
smaller. The proper error correction 
tends to make for a larger ow? and a more 
Satisfactory relationship between the 
two. 

Anattempt was made to systematically 
rework Cattell’s published data using 
appropriate error corrections, but un- 
fortunately the small Ns and low test 
reliabilities resulted in fairly erratic data 
—a number of the corrected variances 


total population or the specific subgroup in 
question, depending on one's theoretical 
Preference (see Cattell et al, 1955, for a 
discussion), 
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were negative—and the effort was aban- 
doned. It is the writer's impression, 
however, that on the whole the effect of 
the corrections was to increase estimates 
of between-family variance relative to 
within-family variance, and of heredity 
relative to environment, compared to 
Cattell’s published estimates. 
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ON THE ASSUMPTION ОЕ А STEEPER AVOIDANCE 


GRADIENT IN MILLER’S CONFLICT THEORY 


RICHARD S. BOGARTZ 


Institute of Child Behavior and Development, University of Iowa 


Miller (1959) derives the prediction that in an approach-avoidance 
situation the animal sometimes will approach the goal, but stop short 
of it. He assumes 2 gradients which cross, and he represents these 
gradients by straight lines. He then asserts that the deductions could 
be made on the basis of any curves that have a continuous negative 
slope that is steeper for avoidance than approach at each point above 
the abscissa. This assertion is correct; however, it is shown here that 
the assumption of an everywhere steeper avoidance gradient rules out 
exponential decay curves as gradients and, in fact, rules out all mono- 
tonically decreasing, differentiable functions which have a limiting 


value of 0 as distance from the goal increases. 


gradients cross appears sufficient. 


Miller (1959) has presented a theo- 
retical treatment of approach-avoidance 
conflict in which he derives the prediction 
that under certain conditions the animal 
will tend to approach the goal when 
started far from it, but will stop ap- 
proaching before the goal is reached. He 
assumes the existence of an approach 
gradient and an avoidance gradient 
which cross, such that near the goal the 
avoidance gradient is higher than the 
approach gradient, but far from the goal 
this relation is reversed. It is assumed 
that the animal approaches the goal if 
the approach gradient is higher than the 
avoidance gradient and avoids it if the 
avoidance gradient is higher than the 
approach gradient. It follows, then, that 
the animal tends “to remain in the region 
where the two gradients intersect [p. 
206]." 

In the graphic representation of this 
derivation (see his Figure 1), Miller uses 
straight-line gradients, but he remarks, 


It is only for the sake of simplicity that the 
gradients are represented by straight lines in 
these diagrams. Similar deductions could be 
made on the basis of any curves that have a 
continuous negative slope that is steeper for 
avoidance than for approach at each point 
above the abscissa. 


This statement is correct, however cer- 
tain functions, e.g., those appearing in 
Figures 10 and 11 of Miller's (1948) paper 


Requiring that the 


on displacement, do not satisfy these 
requirements. F 
It will be shown here that in the situa- 
tion where the gradients are assumed to 
cross, the requirement that the avoidance 
gradient have a steeper slope than the 
approach gradient at each point above 
the abscissa is incompatible with a large 
class of functions. In particular, if it i$ 
assumed (a) that each gradient is a con- 
tinuous, monotonically decreasing func- 
tion of distance from the goal, which is 
differentiable at least once at everY 
distance greater than zero, and (b) that 
the limiting value of each function i$ 
zero, then it cannot be assumed that the 
avoidance gradient is everywhere steeper 
than the approach gradient. " 
Let x be the distance from the goal: 
let F(x) and G(x) be the avoidance an 
approach gradients, respectively. AS 
sume the following. Р 
Assumption 1 (monotonic decreasing 
gradients) : if x» > ху, then F(x) < F(x) 
and б(х) < G(x), | 
Assumption 2 (limiting value of eac? 
gradient is лего): 


lim F(x) = lim G(x) = 0. 


Assumption 3 (steeper avoidance gI2- 
dient at all points): the first derivative? 
of F(x) and G(x), F'(x) and G’(x), exis! 
for all values of x > 0; and, for any *' 
Р(х) < G'(s). 
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Р Since the proof is only for the situation 
in which the gradients cross, the features 
of this situation must also be specified. 

Crossing of gradients condition: there 
exists a point хо such that (а) F(xo) 
=G (xo); (b) ifx < xo, F(x) > G(x); and 
(c) if x > хо, F(x) < G(x). 

Consider two points ху and x» such that 
Xo < xı <x, From elementary calculus 
it is known that 
т 


F(x.) — F(x.) = f 


JT 


OF) dx 
and 


т 


GG) — GG) = Í * G'G) dx 


z1 
Therefore, by Assumption 3, 
т 


G (xs) € G(x) = Í 


z 


| С'(х) dx 


> f Pr) de m Flea) — P: 


1 


Hence, by addition and subtraction, 
F(x) — С(ху) > Е(х:) — GG) 


By Part c of the crossing-of-gradients 
condition, the value on the left-hand side 
of this inequality is a negative number. 

Owever, by Assumption 2, the value on 
the right-hand side (also negative) may 

© brought as close to zero as desired by 
Selecting x. sufficiently large. In par- 
ticular, it can be brought closer to zero 
than is F(x,) — G(x). But this contra- 
dicts the inequality and completes the 
Proof. 
tne XPonential decay curves exemplify 

е problem. Let F(x) = ae-= and 

(x) = beis, where a » b and >17. 


The functions satisfy Assumptions 1 and 
2 and the crossing-of-gradients condition. 
Assumption 3 requires that — iae—* 
= F'(x) < С'(х) = —jbe, or, rear- 
ranging terms, that ia/jb > еб-2=. Ob- 
viously a sufficiently large value of x can 
be chosen such that the inequality is 
false. 

No special brief is held here for either 
the monotonicity assumption or the zero 
limit assumption. The former seems 
psychologically reasonable and consistent 
with Miller's (1959) first two postulates 
(p. 205). The latter also seems reason- 
able in that one might expect that both 
the approach and the avoidance gradients 
could be made as close to zero as desired 
by placing the animal sufficiently far 
from the goal; however, it is not clear to 
what extent this assumption is included 
in Miller's thinking. The point to be 
made here is that the requirement of an 
everywhere steeper avoidance gradient 
appears to be unnecessarily severe. The 
crossing-of-gradients condition seems to 
suffice for the predictions with which 
Miller is concerned. 
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IN DEFENSE OF REMOTE ASSOCIATIONS 


KENT M. DALLETT 


University of California, Los Angeles 


Slamecka (1964) has argued that the evidence for remote associations 
resulting from the use of the association method is of dubious validity, 
owing to a response-bias artifact. Slamecka’s argument fails to be 


convincing: on both logical and ex 


the response-bias artifact does 
association method. 


In order to support his contention that 
the doctrine of remote associations (RAs) 
is of “doubtful validity” Slamecka (1964) 
has tried to show that each of the meth- 
ods commonly used to demonstrate RAs 
does so by means of an artifact. The pur- 
pose of this note is to point out that 
Slamecka’s attack on the association 
method does not, as he supposes, cast 
doubt upon the existence of RAs as 
demonstrated by that method. 

In the association method the subjects 
learn a serial list and are then shown 
items from the list, one by one in 
scrambled order, with instructions to give 
the first other item from the list that 
comes to mind. If the subject gives the 
item which, in learning, followed the item 
presented, his response is classified as an 
adjacent forward association. If he gives 
any other association, his response is 
classed as remote and assigned a degree 
of remoteness depending upon its linear 
distance from the stimulus syllable in the 
original list order. Since the presence 
of responses other than the originally 
"correct" one could be accounted for on 
several grounds, users of the association 
method typically grant no importance to 
the simple fact that such RAs occur and 
focus instead upon the fact that the fre- 
quency of RAs decreases as their degree 
of remoteness increases. This will be 
referred to as the RA gradient, and the 
existence of such a gradient will be taken 
as evidence that the RA doctrine is still, 
in some sense, a useful one. However, 
the raw RA gradient which results from 
simple tabulation of RAs is unacceptable 
evidence, because there are more oppor- 
tunities for RAs of low degrees of re- 


perimental grounds it appears that 
not invalidate the results of the 


moteness to occur than there are oppor- 
tunities for RAs of high degrees of 
remoteness to occur. A “correction for 
Opportunity” has been traditional, in 
which the frequencies of RAs with few 
Opportunities to occur are multiplied by 
an appropriate factor: the best-known ex- 
ample of such a correction is that give? 
by Raskin and Cook (1937), Slamecka 
(1964) claimed that an additional arti- 
fact must be taken into account. Accord- 
ing to Slamecka, subjects may be biased 
to give items from near the ends of the 
list as responses, as a result of the rela- 
tively rapid learning of end items usually 
described as the serial position effect. 
Given the conditions of the association 
method, and a bias to give end items a5 
responses Slamecka (1964) states, 


Such a state of affairs should logically, {Б 
and of itself, produce the pattern of resu 
commonly reported [p. 69]. 


However, Slamecka does not explain the 
logical necessities involved, and in fact ub 
seems more likely that Slamecka’s bias 
should sot “logically, in and of itsel 
produce the RA gradient, The reason for 
this is that end items, but not middle items: 
can enter into RAs of high degrees M 
Temoteness—hence a bias in favor of en! 
items should flatten or invert the s 
A gradient, while a bias favoring ma 
dle items should enhance the gradiene 
Slamecka tried to counterfeit an С 
gradient by giving the subjects the ар 
Propriate response bias in the absence = 
serial learning and supported the argh” 
ment given here: after correction for OP 
portunity he found virtually no for wal, 
gradient, and an inverted gradient 
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TABLE 1 


EXAMPLE or RA Dara, TAKEN FROM SLAMECKA’S (1964) TABLE 2, 
ArrER REMOVAL OF ADJACENT FORWARD ASSOCIATIONS 


Response items 


Stimulus it 
s items г 3 s " = = Total 
(X) = 1 7 + 8 9 39 
: am -— 8 2 9 14 33 
g 24 = e 4 8 11 47 
3 14 9 = = 14 11 48 
4 13 14 10 = — 8 45 
5 26 8 6 5 — — 45 
6 17 13 12 6 6 — 54 
Total 94 55 43 21 45 53 311 


backward RAs. Hence we may flatly re- 
ject Slamecka's (1964) conclusion that 


the overall results of the association method 
can be brought about without any previous 
Serial learning [p. 71]. 


In deciding that his artifactual gradient 
Successfully approximated an RA gradi- 
ent, Slamecka appears to have depended 
upon its similarity in shape to the U- 
shaped gradients reported by Raskin and 
Cook (1937). A closer examination of 
Slamecka's data, however, suggests that 
the similarity is not great enough to be 
convincing, and further suggests that the 
troublesome U shape in the Raskin and 
Cook data has been somewhat over- 
emphasized by the correction for op- 
Portunity usually used in such studies. 

, The major drawback of the conven- 

tional correction for opportunity is that 
re requires multiplying small (and possi- 
bly unreliable) frequencies of infrequent 
RAs by large correction factors. The 
result is that minor differences in RA 
frequencies are likely to become unduly 
Magnified. An excellent example appears 
in the data of Raskin and Cook (1937), 
m which RA frequencies of 12 and 11 
ius не 42 and 77, respectively, after cor- 
aes ue An unwary reader is likely to 
PAS en in by (a) the large difference 
= een the corrected values, or (b) the 
oe themselves; although in ac- 
2 fact the data reveal a difference of 
€ item between frequencies which can- 


not possibly exceed their expected values 
by more than 11 or 12—probably less, 
depending on the nature of the expected 
values. One can design a better correc- 
tion, involving a comparison of RA gradi- 
ents obtained in an experiment, with the 
gradients to be expected if differential 
opportunity alone were operating. In de- 
signing such a correction method, there 
are two further sources of bias which 
must be considered. First, the limited 
opportunities for RAs of high degrees 
of remoteness are further restricted by 
the fact that a subject may give more non- 
remote or "correct" responses to the 
easier end items (which participate in 
RAs of high degrees of remoteness). 
Such a bias is not taken into account by 
the usual correction for opportunity, 
which assumes that every item serves 
equally often as the stimulus for an RA. 
The second bias is along the lines indi- 
cated by Slamecka: if the subjects use 
responses from the end of the list more 
often than they use responses from the 
center of the list, they are more likely to 
give RAs of high degrees of remoteness 
and vice versa. 

For convenience in illustrating the pro- 
posed correction method, the data of 
Slamecka's (1964) Experiment IV will 
be used. In Table 1a matrix of the ob- 
tained RA frequencies appears. The 
rows represent stimulus items, and the 
columns represent the same items used 
as responses. The cell entries are the 
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frequencies with which a given response 
occurred to a given stimulus in the as- 
sociation test. Slamecka placed a sepa- 
rate stimulus item in initial position (X) ; 
in most RA data this does not appear. 
Prior to analysis, it will be noted that 
nonremote (adjacent forward) associ- 
ations have been removed. In some 
analyses (e.g., Dallett, 1959), an associ- 
ation in which the first item in the list is 
given in response to the last item is con- 
sidered an adjacent forward association 
and is also removed, leaving the lower 
left-hand cell vacant. 

Using the appropriate marginal fre- 
quencies, an expected matrix is con- 
structed, incorporating the observed stim- 
ulus bias, response bias, or both. For 
example, the matrix of expected RAs, 
taking only stimulus bias into account, 
would have entries of 7.80 in each of the 
nonvacant cells of the first TOW, under 
the assumption that of the 39 RAs elicited 
by Stimulus (X), Responses 2, 3, 4, 5, 
and 6 аге equally likely, and therefore 
each cell entry is 39 + 5, The entries in 
each nonvacant cell of the second row 
would be 33--4 ог 825. Referring to 
Table 1, it will be seen that the obtained 
frequency for forward RAs of degree 1 
is 11--8-4- 4-- 14 4-8 — 45, The ex- 
pected frequency, on the basis of stimulus 
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Fic. 1. Expected (circles) and obtained 
(triangles) RA gradients from Slamecka's 
Experiment IV. (The crosses indicate the 
effect of an additional correction for response 
bias applied to the expected values.) 
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Fic. 2. A comparison of expected (“рге- 
dicted") and obtained RA gradients for the 
Subjects tested at different intervals after 
learning (Dallett, 1959). 


bias and opportunity alone, is 7.80 + 825 
+ 1125 + 12.00 + 11.25 = 51.05, It will 
be noted that the correction for oppor- 
tunity arises from the fact that both ex- 
pected and obtained matrices contain the 
same disproportionate opportunities for 
RAs of low degrees of remoteness to OC- 
cur. For forward RAs of Degree 5 
there is only one opportunity, with an 
obtained frequency of 9 and a predicted 
frequency of 7.80. 


igure 1 presents the obtained and er 
pected RA gradients for Slamecka’s data. 
The expected gradients are represented in 


+The conventions for numbering degrees 
of remoteness need clarification, In the dis- 
Cussion of Slamecka’s data, adjacent forward 
associations are labeled "degree 0” in agree- 
pent with Slamecka's usage. In Figure 1, 
however, adjacent backward associations aP- 
Far as "degree —1"; Slamecka calls these 
degree 0" also, In Figure 2, the convention 
adopted was to call the stimulus “degree 0”; 
hence, the first backward degree of remote- 
ness is “degree —1 and adjacent forwar! 
js Hom (not plotted) are of “degree 
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circles with solid lines, while the obtained 
gradients appear as triangles and dashed 
lines. As previously indicated, Sla- 
mecka's obtained RA gradient appears to 
be flatter than one would expect on the 
basis of opportunity and stimulus bias 
alone. It should be noted that the mean 
difference between expected and obtained 
gradients must be zero, so that valid 
interpretations are limited to the dif- 
ferent shapes of the gradients, and differ- 
ent meanings cannot be given to devi- 
ations below and deviations above the 
expected gradient. 

An analysis incorporating these stimu- 
lus-bias and opportunity corrections was 
carried out in an earlier study (Dallett, 
1959), using actual data obtained by the 
association method. Expected (circles) 
and obtained (triangles) RA data from 
that study appear in Figure 2. The asso- 
ciation test was given immediately, 20 
minutes, or 48 hours, after the subjects 
learned a list of 14 CVCs to a 79% 
criterion of mastery. Inspection of Fig- 
ure 2 reveals that the obtained RA gradi- 
ent is in general steeper than the expected 
one. A correction for response bias was 
not attempted, but would probably have 
very little effect, the frequencies of usage 
of Responses 1-14 being, respectively, 
40, 35, 39, 49, 31, 25, 30, 45, 34, 37, 31, 
50, 49, 44. A serial position response 
bias 1s not apparent in these data. 

It is possible to incorporate an approxi- 
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mate correction for response bias as well. 
The expected matrix for Slamecka’s data, 
with both stimulus and response bias in- 
corporated, appears in Table 2. To il- 
lustrate the computation of cell entries, 
the first entry in the first row is (55 X 
39) — 217; in other words, the row total 
is distributed among cells in such a way 
that cell entries are proportional to the 
relevant column totals. The reason for 
this being an approximate correction is 
that each row total is associated with 
different columns, and it will be noted 
that the column totals are not recovered 
exactly, as they would be, say, in a con- 
tingency table for which complete inde- 
pendence of rows and columns was a 
tenable hypothesis. Despite the approxi- 
mate nature of the correction, it will be 
noted from the crosses in Figure 1, that 
the expected RA gradient, taking response 
bias into account, is an improved predic- 
tor for 9/10 data points, as would be 
expected if we attribute the flatness of 
Slamecka's RA gradient to response bias 
effects. 

Slamecka's data and the data of Dallett 
(1959), when considered together, indi- 
cate that the gradient of RAs which the 
association method reveals is not entirely 
artifactual, although the methods of 
analysis usually used have ignored some 
artifacts or possible artifacts associated 
with serial position effects. A subject 
does not respond randomly when en- 


TABLE 2 


MATRIX or Expecrep RA FREQUENCIES, BASED UPON THE MARGINAL TOTALS OF 
TABLE 1: Bora Strmutus Bras AND RESPONSE BIAS ARE TAKEN INTO ACCOUNT 


Stimulus items Response items Total 
1 2 x 4 5 6 
= inb 
(X) — 9.88 7.73 3.77 8.09 9.52 38. 
T E = 8.7 4.28 9.17 10.80 33.00 
2 20.74 — — 4.63 9.93 11.69 46.99 
3 18.26 10.68 = = 8.74 10.30 47,98 
4 17.26 10.10 7.89 — — 9.73 Ric 
5 19.86 11.61 9.08 4.44 = = 44, 
6 19.67 11.51 9.00 4.39 9.42 — 53.99 
Total 95.79 53.78 42.45 21.51 45.88 52.04 310.92 
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couraged to respond freely. Whether 
this is a basis or byproduct of serial 
learning and the serial position effect is 
still an open question. 
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IN DEFENSE OF REMOTE ASSOCIATIONS 


B. R. BUGELSKI 
State University of New York at Buffalo 


Recently Slamecka criticized the concept of remote associations and 
attacked 3 kinds of evidence for the concept. His attacks are analyzed 
and questioned as to relevance to the issue. А 1-trial learning experi- 
ment is reported wherein familiar items are shown to follow the 
traditional pattern of distribution of remote associations. Ехрегі- 


mental results refute deductions from Slamecka’s arguments. 


It is 


concluded that not only are direct verbal associations formed in 
serial learning (contrary to Slamecka’s claim) but that remote associ- 
ations are obviously formed in early stages of learning. 


The logical and experimental attack by 
Slamecka (1964) on the concept of re- 
mote associations (RAs) is a remarkable 
tour de force. His analysis of three lines 
of evidence for such associations led him 
to the conclusion that not only are RAs 
nonexistent, but that in the learning of 
serial lists even direct associations are 
not formed, ie, the items in the list do 
not serve as stimuli for succeeding re- 
sponses but that, instead, responses are 
Somehow learned to “self-generated se- 
quential or spatial" symbols such as 

first," “second,” etc. Should such con- 

clusions actually turn out to be valid, a 
great deal of traditional work on serial 
learning would have to be reevaluated, 
and much theoretical speculation would 
have been generated for naught. In view 
of the enormous amount of such literature 
and speculation, Slamecka’s arguments 
Should not be allowed to stand without a 
challenge. 

It is my contention that Slamecka has 
concocted an ingenious and plausible, but 
jo ocn account of the process of serial 
‘earning, Much of what he has to say 
1s indisputable, and, granting his assump- 
tions, much of his argument appears un- 
assailable. Closer examination, however, 
Suggests that the attack is directed at a 
obe of straw men, and that the tradi- 
т views Slamecka hopes to deny are 
ardly affected. It is, however, proper 

point out that the fresh approach taken 
T Slamecka has produced a number of 
See ag in our understanding of serial 

rning. These will be duly noted. 


In this commentary on Slamecka’s 
analysis I shall first point out some 
general considerations and then comment 
on his specific arguments and supporting 
experimental evidence. 


GENERAL CONSIDERATIONS 


Slamecka argues that traditional psy- 
chology has made a mystery of the com- 
mon finding that different parts of a list 
are learned with different efficiency, with 
the point just beyond the middle of a list 
being the most difficult to master. Theo- 
rists like Hull (1935) tried to explain 
such findings in terms of postulate sys- 
tems dnd used data from experimental 
tests of derived theorems as support for 
the postulates. He regarded the differ- 
ential difficulty demonstrated in serial list 
learning as a real and meaningful prob- 
lem, a problem that called for solution 
along theoretical lines. Slamecka, in his 
approach, completely reverses the situ- 
ation. He assumes the differential dif- 
ficulty as a given that requires no explan- 
ation and proceeds to use this empirically 
established difficulty as the basis for his 
explanation of what goes on in serial 
learning. This appears to bea clear case 
of begging the question. Slamecka s raw 
empiricism solves the theoretical problem 
The nature of association 


by ignoring it. 5 
H reduced to a lawless cataloging oper- 
ation. 


ation offered by Slamecka 
tatement that when a 
a serial list he really 
al position of each item 


The explan 
amounts to a § 
learner masters 
learns the ordin 
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as a kind of geographical locus or point 
in some kind of space. Each item is 
learned without regard to any possible 
associations it might have with preceeding 
items or with those that follow. It would 
be fair to ask Slamecka to reply to such 
questions as: “What is the seventeenth 
letter of the alphabet?” or “Who was the 
twenty-first president?” without any at- 
tempt at such operations as reciting prior 
letters or taking advantage of any other 
known common associations like “Lincoln 
was 16; Johnson came after Lincoln, so 
he was 17; then came Grant, 18; then 
Hayes, 19; Hayes was followed by Gar- 
field, 20, and, ah, yes, Garfield, Arthur, 
21st.” 

The proposition that we learn serial 
lists of nonsense syllables through some- 
how noting their order without associ- 
ating the syllables with each other is cer- 
tainly challenging even though not readily 
acceptable. There is no question that the 
first and last Syllables are learned rela- 
tively quickly and that they are recog- 
nized and probably labeled as "first" and 
"last" very early in the process. But to 
think that such essentially superfluous in- 
formation is the essence of learning serial 
lists is to ignore the f. 


s act that in many 
serial tasks, for example, in the playing 


eciting poetry, 
gets commonly 
point and starts 


When a learner does master a relatively 
short list, e.g., 10 items, he may very 
well be able to tell which syllable occupies 
what position, but not because he learned 
them according to the numbers. If we 
ask such a learner for the eighth syllable, 
he can quickly rehearse the list back- 
wards, ie, he "knows" the last syllable 
well and can probably generate its stimu- 
lus. That may enable him to think of 
the previous item, Similarly he can tell 
us the second and the third. With longer 
lists we could expect the process to break 
down just as we can expect our college- 
educated colleagues not to know the six- 
teenth letter of the alphabet or the 
twentieth unless they start from some 
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fixed point and count while associating 
one letter with its predecessor. 


SpPEcIFIC ARGUMENTS 


Slamecka attacks the principle of re- 
mote association on three fronts, those 
of “revised lists,” association tests, and 
intrusion errors, Each of these has been 
used in the tradition to support or justify 
the concept of remote association. How- 
ever, even confirmed believers in such as- 
sociations have admitted that the first two 
Sources of evidence are not strong and 
merely support the concept. Slamecka, 
though recognizing such admissions, 
mounts his strongest attack on the first 
two lines of evidence. I shall examine 
each argument in order, 


Method of Derived Lists 


Slamecka argues that studies using p. 
rived" lists do not provide evidence fo 
RAs; that, instead, they merely prove 
that a subject could instruct himself 10 
take advantage of known facts about ШЕ 
construction of the list, e.g., a subje 
learning a first-degree remote derived lis 
might simply say to himself: “Every ae 
ond one." Ѕјатеска goes on to ps 
that learning derived lists is no mor 
efficient than learning control lists. еш, 
ally these arguments are beside the Вон. 
as the evidence, since Ebbinghaus, Bas 
Shown that derived lists may inhibit per 
formance as well as facilitate it. (Б 
is no way of telling whether a dene 
list will be easy or difficult to learn Se 
any given instance, All that Ebbinghau 
claimed was that a first-order derive 
list would be learned better than a secon A 
or third-order list, Slamecka reports de 
experiment in which he compares 2 ^ 
rived list with a control list and finds 1 
differences, Such 4 finding is simply 
irrelevant, ed 

In designing his study Slamecka ues 
What he calls à "modified" or "avera£ s 
derived list which consists of three kin s 
of items—some are direct association 
from the original list, some are one re 
gree remote, others are two degrees ub- 
mote. This he argues prevents the SUA 
ject from Using some easy form 
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approach such as “skip every other one.” 
Certainly such mediational devices should 
be controlled, but to confuse the subject 
with an arrangement where three dif- 
ferent kinds of mediators might well oc- 
cur to the subject is certainly not the 
Proper procedure for getting rid of 
mediators. To make matters worse, 
Slamecka uses letters of the alphabet 
which already have strong direct associ- 
ations, which confuse the issue further. 


Association Method 


, Here Slamecka argues that serial posi- 
tion effects are the real explanation of 
so-called RAs. Syllables are learned best 
at both ends of a list and therefore have 
a higher availability if called for in as- 
Sociation tests. How such an argument 
explains RAs is difficult to understand as 
Subjects should not know the middle items 
Well and would be unable to respond with 
them to early items. Only the first few 
items should have any RAs at all, and 
these should be from the other end of the 
list instead of from the middle. In prac- 
tice, most RAs come from the middle 
Of a list, 
m To test his argument, Slamecka de- 
Signed a procedure of random exposures 
of Six nonsense syllables at different fre- 
quencies and then tested for associations. 
Я 15 procedure called for 25 exposures of 
ne syllable, 10 exposures of two others, 
uate” more, and only 1 of the sixth 
ane le. To take this arrangement of 
mo 1o MESA as an analogue of 
Her learning, we would have to assume 
n Subjects normally take 25 trials to 
syl vw six-syllable list and that the first 
tur на is learned 25 times better (faster) 
a n the middle one. Such assumptions 
Tain credibility, but apparently are re- 
a to produce the data Slamecka then 
а demonstrate that serial position ef- 
h S account for RAs and not vice versa. 
alin, procedure also results in a finding of 
aim. as many direct backward associ- 
Me S as forward associations. (45-40) 
fa backward remote associations of 
ward second, and third degree than for- 
ir associations—a result yet to be re- 
Tted in any learning literature. In 
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fact, Slamecka finds more backward than 
forward associations if we compare the 
four degrees of remoteness possible for 
backward associations with the first four 
degrees of forward remoteness. This 
fantastic result belies any comparability 
of such artificial procedures and normal 
learning. The argument from this anal- 
ogy appears to be strikingly irrelevant 
to the issue. It should be noted that in 
this attack Slamecka appears to be argu- 
ing that RAs are given by the subjects 
because the responses are known and 
available, i.e., RAs are a function of avail- 
ability. In the next section we will note 
that Slamecka argues that RAs occur be- 
cause the responses are not known—the 
responses that are known and identified 
by position are inhibited by the subjects 
because they are known to be incorrect. 
The argument appears heavily ad hoc. 


Method of Anticipatory and Perseverative 
Errors 

Under this heading Slamecka offers no 
new experiment; what he does instead is 
to report data from a previous experiment 
which demonstrate the traditional findings 
of RAs, Slamecka explains these RAs, 
however, as matters of inadequate identifi- 
cation of the position of different re- 
sponses. To learn a list, runs the argu- 
ment, subjects must learn not only the 
item itself, that is, become familiar with 
the item per se, but must also learn its 
position. Until an item is learned (i.e.. 
becomes familiar) it cannot be ventured 
as an association, remote or otherwise. 
As learning proceeds with the first and 
last items becoming identified and elimi- 
nated as potential intruders, the field 


om which RAs can be drawn is nar- 
= ecomes identified 


rowed. As each item becom 

with a certain position, it is no longer 
ventured as an This explains why 
the middle of the list provides most of 
these incorrect, “remote,” associations or 
“intrusions.” While the items are still 


unlearned, states Slamecka (1964), 


At this early stage he is more likely to emit 
remote association of а higher degree than 
later, since his ignorance of the list is maxi- 
mal and his guesses are less constrained. 
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From this remark I gather that the sub- 
ject is unaffected by any serial position 
effects and that his guessing is just that 
the subject is just as likely to respond 
with any one item as any other. Of 
course, with succeeding trials the num- 
ber of intrusions decreases, and eventually 
there are none. The traditional pattern 
of RAs is explained away by Slamecka 
as a simple artifact or by-product of the 
assumption that when the subjects learn 
a list they learn it from both ends at first 
and slowly close in on the middle items. 
As the middle range is narrowed the de- 
gree of remoteness available decreases: 
this automatically results in more first- 
order RAs, fewer second order, still 
fewer third order, and so on. While 
there is some merit to this argument, it 
is not at all a fact that a list is learned in 
the symmetrical way Slamecka requires, 
Commonly, there is a clear advantage to 
the early items over the late ones. In 
any event, on the very first trial, there 
can be no such cumulative squeeze play 
as Slamecka describes. If RAs are 
formed on such a first trial, and if they 
are from an unconstrained range, they 
should not follow any particular pattern 
of distribution; but, if Slamecka is right, 
they should occur more or less randomly. 

The Slamecka argument suggests two 
testable deductions: (a) if the subjects 
are asked to learn a list of already fa- 
miliar items, say common words, they 
will not waste time on a familiarization 
stage—they can become reasonably fa- 
miliar with the items in one trial. All of 
the words in a short list should then be 
more or less available as intrusions. If 
only one trial is given, there can be no 
piling up of short-range RAs, Further, 
if one word, e.g., the third word of the 
list, is then presented to the subjects 
with a request that they name the word 
that followed it, they should be free to 
respond with any of the words of the 
list except the key word itself. If they 
respond randomly, then Slamecka can be 
credited with a correct analysis. If, on 
the other hand, the pattern of responses 
follows the traditional curve (see Bu- 
gelski, 1950), then his argument must be 
rejected. (b) Because the subjects will 
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undoubtedly learn some of the words and 
be able to report them correctly, i.e., in 
their proper locus in the list, they should 
not use such known responses as RAs. 
If a given subject, for example, knows 
the tenth word to be the tenth and not 
the fourth, he should not use it as a re- 
sponse to the third word. In short, the 
Slamecka analysis calls for a negative 
correlation between the frequency of cor- 
rect responses (correct position location) 
and the frequency with which such words 
are reported as RAs. 


EXPERIMENTAL REBUTTAL 


To test these deductions the author se- 
lected a list of 10 common 4-letter ad- 
jectives from a collection prepared by 
Castaneda, Fahel, and Odom (1961). 
None of the 10 adjectives selected was an 
associate of any of the others as tested 
by Castaneda, Fahel, and Odom; but as 
an additional precaution, three arrange- 
ments of the 10 words were prepared to 
counteract possible tendencies of the sub- 
jects to form meaningful associations be- 
tween the words because of some potential 
"belongingness," 


Procedure 


Three classes of introductory psychology 
Students were used as subjects. In eac 
class, the author, serving as experimenter 
told the subjects that they were to participate 
in a typical serial learning experiment, tha 
they would be shown a list of 10 common 
English words at 3 seconds per word, ап 
that they were to learn the words in serial 
order. Nothing was said about the number 
of trials, and it can be presumed that most 
of the subjects expected several ша 
The words were printed in 4-inch bloc 
letters and could be seen easily by all sub- 
jects. Each stimulus card was exposed 
3 seconds. As soon as the last card awe? 
shown, the experimenter asked the subjects 
to write down the word that follow" 
н та each class а differen 
word order was used, and in each case к 
third word in the list was pronounced as th 
cue or stimulus word to be responded #0: 
Nothing was said about its position. he 
subjects were instructed to write down nite 
correct word if possible and if not to pre 
down the most likely word they could thin 
of from the list just shown. 
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TABLE 1 


RAs To THE THIRD WORD IN A 
10-Worp SERIAL LIST 


Word in Degree of Number of 

list remoteness responses 
1 —2 2 
2 —1 19 
3 — — 

1 0 -8 

5 1 77 
0 2 28 
7 3 19 
8 4 16 
9 5 5 
10 6 11 
Total 177 


Ж The fourth word would be a direct association. It 
Was reported correctly by 178 of the 355 subjects. 


As soon as the subjects had written their 
Tesponses, the experimenter instructed the 
Subjects to write down the 10 words in 
order and to leave blanks where they could 
Not recall the proper word. 


Results 


The total number of subjects was 355, 
and the total number of responses for the 
first test response was also 355. Of these 
178 were correct, and the remaining 177 
Were incorrect. As seen in Table 1 there 
Were 21 backward associations and 156 
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forward associations distributed accord- 
ing to the traditional pattern of mostly 
first-order remoteness, fewer second, less 
of third, and so on. Six degrees of re- 
moteness were possible, and all were 
represented. The presence of such a pat- 
tern after the first trial should adequately 
rebut Slamecka's expectation that the re- 
sponses should be randomly distributed. 
The distribution is significantly different 
from a random assortment of responses 
(x? = 2149, df = 5, p = .005). 

To answer the proposition that the 
subjects in such a situation refrain from 
using items whose position has been 
learned, a Pearson r was calculated be- 
tween the number of responses correctly 
identified in the second test (where the 
subjects tried to report the whole list) 
and the frequency with which such re- 
sponses appeared as RAs.  Slamecka 
would have to assume a negative r. The 
actual r was positive, .593, and sig- 
nificant at the .05 level of confidence. 
It might be noted that the last word, al- 
though learned better than the ninth word 
(see Table 2), appeared as an RA 
twice as often. 

The results just reported indicate that 
RAs are formed even on the first trial 
with familiar material. There can be no 
question of shrinking ranges forcing out 
some artifactual array of RAs. The data 


TABLE 2 


CORRECT AND ERRONEOUS RESPONSES IN A RECALL TEST 
AFTER ONE LEARNING TRIAL 


Correct Incorrect 
Word response, response, Not 
Positi ү wron; D 
list S OET ko d E га 

5 2 
1 348 0 0 5 2 
2 267 4 2 40 + 
3 241 27 10 94 : : 
4 164 14 26 82 З 

5 96 25 19 138 10 
6 63 19 25 113 154 
7 46 24 24 98 187 
8 38 27 17 113 187 
9 45 30 15 11 218 
10 76 3 3 78 198 
Total 1384 223 141 838 1187 


Note,—N = 355, 
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gathered also suggest that one should be 
careful about postulating the order of 
learning in terms of some pattern of be- 
ginning and end first, then the middle. 
It is true that there is some small ad- 
vantage of the very end of the list over the 
ninth and eighth words, but the striking 
feature of the data is the preponderant 
mastery of the early part of the list. Such 
mastery of the first two words could ac- 
count for their infrequent appearance as 
backward RAs, thus supporting Sla- 
mecka's position to some degree. 
Certain interesting data were avail- 
able from the records of the second test 
where the subjects tried to recall the en- 
tire list. These data, shown in Table 2, 
summarize what the subjects can learn 
in one trial and how various responses 
are distributed, Remembering that these 
data are for a total recall operation and 
no longer a simple test of the response to 
the third word, we can note the subjects 
learned almost half of the task in the one 
trial given. Most of the correct responses 
were made up of the first four items. On 
223 occasions a word (see Column 3) was 
reported correctly even though the stim- 
ulus word was missing or incorrect. 
Slamecka might argue that this shows 
that learning serial material is learning 
by the numbers. While this interpreta- 
tion cannot be denied, it is possible to 
explain the occurrence of any word at any 
correct position as a confluence of RA 
effects. In the fourth column are listed 
those cases in which a correct response 
was made to a proper stimulus although 
neither was in the correct position. These 
data might be taken to support the as- 
sociation doctrine Slamecka denied. The 
fifth column shows that each word in the 
list was reported in one place or another 
incorrectly by at least some subjects, Tt 
is surprising to note how frequently the 
last item was reported as other than last. 
This last item apparently was familiar 
enough to be mentioned, but had not 
been identified as “last.” In the last 
column of Table 2 are listed the failures 
to use the word in any form at all. The 
prevailing trend shown is an inability to 
come up with the later words in the list. 


THEORETICAL NOTES 


GENERAL CONCLUSIONS 


The experiment reported above appears 
to answer Slamecka's criticisms of the 
traditional doctrine of remote associa- 
tions on every count. By using meaning- 
ful (familiar) words, it is possible to take 
advantage of first trial responses which 
are uncontaminated by the effects of re- 
peated trials. The argument advanced by 
Slamecka relies heavily on cumulative 
trial effects as well as rapid learning of 
the end of the list. Neither of these fac- 
tors can be responsible for the present 
results, 

The review of Slamecka's arguments 
showed them to be less directly pertinent 
than a first reading might suggest. The 
experiment reported here throws mur 
tional doubt on Slamecka's hypothetica 
position curve and makes his arguments 
from repeated trials irrelevant. . а 
simple demonstration of the traditiona 
curve of RAs developed after one run 
through a serial list should keep RAs i" 
the category of useful psychological con- 
structs for a while longer. The doctrine 
of direct association of one word wit 
another which Slamecka also challenged 
hardly needed support, but the data re- 
ported in the present study reveale’ 
enough instances of word-to-word associ- 
ation regardless of serial position tO 
justify the confidence of former research- 


+ . HN MH 5 
ers in the reality of such association 
also, 
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psychiatrist after reading 


TOWARD AN 
UNDERSTANDING 
OF 
HOMOSEXUALITY 


by Dr. Daniel Cappon, 
M.B., F.R.C.P. (Edin.) 


Dr. Cappon, Professor of Psy- 
chiatry and chief of the Psy- 
chodynamics and Psychopathology 
Section at the University of 
Toronto, has a large private prac- 
tice. In this landmark book Dr. 
Cappon contends that homosexu- 
ality is a symptom of faulty 
development and adaptation, that 
its causes are reversible and conse- 
quently it can be cured by psycho- 
therapy. As a physician in psycho- 
logical practice he has diagnosed 
and treated some 200 homosexual 
patients. 


Dr. Cappon corrects many common 
misconceptions and provides deep 
insights into the moral, historical, 
sociological and religious as well as 
clinical aspects of homosexuality. 
This honest, realistic and optimistic 
new book offers counsel to doctors, 
clergymen and others who are 
called upon to help people with 
homosexual problems. Bibliog- 
raphy, $6.95 


at your bookstore 
or Dept. 301 


PRENTICE-HALL, INC. 


Englewood Cliffs, IN. 4. 
p náá—— n 


THE BRITISH JOURNAL 
OF PSYCHOLOGY 


Edited by Boris SEMEONOFF 


Vol. 55 Part 4 November, 1964 25s. net 


OBITUARY NOTICE: 

CHARLES WILFRED VALEN TINE. 
MICHAEL ARGYLE. Introjection: A form of Social learning, 
P. M. A. RABBITT. Ignoring irrelevant information. 


G.S. TUNE. Sequential errors ina time-sharing task. 


. and J. H. McFARLAND. Relation be- 
tween perceptual and conceptual operations 
t. 


: Numerical distance and visual 

exten 

R. CONRAD and A. J. HULL. Information, acoustic confusion and memory 
span. 


M. HAMMERTON and A. N. TICKNER, 


Transfer of training between space- 
oriented and body-oriented control situations. 


T. X. BARBER and D. S. CALVERLEY. Experimental Studies in ‘hypnotic’ 
behaviour: Suggested deafness evaluated by delayed auditory feedback. 


- Validation of the Maudsley personality inventory 
E scale. 
Е. I. M. CRAIK. An observed age difference in Tesponses to a personality 
inventory. 


R. N. HUGHES. Responses by the ferret to stimulus change. 
R. W. PICKFORD, A deuteranomalous artist. 

ANN D. M. DAVIES. Season of birth 
CYRIL BURT. Baudouin on Jung. 
PUBLICATIONS RECENTLY RECEIVED. 
OTHER PUBLICATIONS RECEIVED. 


э intelligence and Personality measures. 


—— 


The Subseription price per volume, payable in advance, 
is 755. net (post free). 


_ 


Subscriptions may be sent to any bookseller or to the 


CAMBRIDGE UNIVERSITY PRESS 
Bentley House, Euston Road, London, N. W. 1 


ee 


ATTRIBUTE- AND RULE-LEARNING ASPECTS OF 


PSYCHOLOGICAL REVIEW 


CONCEPTUAL BEHAVIOR' 


LYLE E. BOURNE, JR. 


University of Colorado 


ROBERT C. HAYGOOD 2 
University of Utah 


AND 


Analysis of the structure of concepts discloses 2 major features: (a) 
the relevant attributes, and (b) the conceptual rule by which the 
attributes are combined to form the concept. 2 experiments were 
performed to separate experimentally attribute identification and rule 
learning under a variety of 2-dimensional conceptual rules such as 
conjunction and inclusive disjunction. The results indicated that the 
rules differ in difficulty initially regardless of whether or not the 
relevant attributes are known, but that the differences decrease across 
successive problems. Further, it was found that knowledge of the 
rule represents valuable information to S which improves perform- 
ance significantly. The findings were interpreted as demonstrating 
the validity of the analysis of conceptual behavior into attribute- and 
rule-learning components, and suggestions for future research to 
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explore rule learning were formulated. 


Problems used in experimental 
studies of conceptual behavior have 
two major features, either or both of 
which may be initially unknown to 
the subject. First, there are the stim- 
ulus characteristics which make up the 
Specific concept-to-be-learned. Con- 
ventionally, these are called the rele- 
vant attributes; in commonly used 
geometric designs, they are values on 
| саш stimulus dimensions. Second, 
А еге is the general form or type of 
Oncept, represented by a rule which 
combines or otherwise elaborates the 
relevant attributes to define the con- 
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cept. For example, a bidimensional 
principle for classification might spe- 
cify that "all red square figures are 
examples of the concept." In such a 
concept color and form are the rele- 
vant dimensions, while red and square 
are the relevant attributes or values of 
the dimensions; the conceptual rule is 
conjunction or joint presence of at- 
tributes. 


Attribute Identification 

In most concept-learning studies, 
particularly the concept identification 
experiments (e.g. Archer, Bourne, & 
Brown, 1955; Bourne, 1957), interest 
has centered primarily on the discov- 
ery or identification of relevant attri- 
butes. Typically the general form of 
solution is described and illustrated for 
the subject with preliminary instruc- 
tions and practice problems and, as 
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such, constitutes a “given condition.” 
Indeed, the majority of these experi- 
ments have employed simple and famil- 
iar unidimensional or conjunctive con- 
cepts or solutions. Once the unknown 
relevant attribute(s) is (are) discov- 
ered, through an inductive process based 
on the subject's observations of a series 
of examples and nonexamples of the 
concept, the problem is to all intents 
and purposes solved. 

Correspondingly, concept-identifica- 
tion studies have explored variables 
which appear to be important to the 
identification of relevant attributes, 
such as number of relevant and irrele- 
vant stimulus dimensions (Walker & 
Bourne, 1961), amount of intra- and 
interdimensional variability (Battig & 
Bourne, 1961), and redundancy be- 
tween dimensions (Bourne & Hay- 
good, 1959; Haygood & Bourne, 
1964). "Theoretical interpretations of 
concept identification have been based 
largely upon processes involving stimu- 
lus features of a single problem, such 
as the conditioning and adaptation of 
relevant and irrelevant cues (Bourne 
& Restle, 1959) and the sampling and 
testing of hypotheses about these fea- 
tures (Restle, 1962; Trabasso & 
Bower, 1964). 


Conceptual Rules 


Aside from incidental comparisons 
of the difficulty of unidimensional 
and conjunctive concept identification 
(Bourne & Haygood, 1959; Walker 
& Bourne, 1961), there has been no 
concern in this series of studies with 
type of concept or conceptual rule as 
a variable. The tacit assumption has 
been that identifying relevant attri- 
butes will be affected in much the 
same way by important variables re- 
gardless of the rule. Indeed, prelimi- 
nary evidence, showing that the effects 
of number of relevant and irrelevant 
dimensions are the same for concepts 
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based on conjunctive and bicondi- 
tional rules, has been reported (Ke- 
pros & Bourne, 1963). There is, 
however, some recent evidence (e.g. 
Conant & Trabasso, 1964; Hunt & 
Kreuter, 1962; Neisser & Weene, 
1962 ; Shepard, Hovland, & Jenkins, 
1961) to indicate that, under certain 
circumstances, types of concepts differ 
considerably in their difficulty. For 
this reason, it seems important to con- 
sider in detail the role that conceptual 
tules may play in concept learning. 
Although there are many rules for 
combining stimulus attributes and gen- 
erating concepts, this paper will be 
limited to a system of constructing 
nominal, or discrete, concepts with, at 
most, two relevant attributes. 

Consider first a closed stimulus 
population (Hovland, 1952) gener- 
ated by x independent dimensions 0 
У values each, consisting of the y” pos- 
sible combinations of stimulus attri- 
butes. When both x and у are two 
or more, it is possible to select tw 
values from separate dimensions to Ре 
focal attributes. This selection maps 
the entire stimulus population onto 
the four categories or contingencies 
defined by the presence and absence 
of the focal attributes. For example, 
if redness (R) and squareness (S) 
are chosen as focal attributes, the four 
contingencies so defined are RS, R^ 
RS, and RS.: 

When these two focal attributes are 
further selected to be relevant attri 
butes in defining a concept, the four 
contingencies are mapped onto a tw" 
response system: examples and non- 


ў 

* The bar over a symbol stands for “not, 
hence the Categories read "red square’ 
"red, not Square"; etc. In logical terminos 
ogy the symbols T (true) and Е (also 
are used, representing the presence and ee 
sence, respectively, of the attribute ар пе 
fied. Thus in general terms, the cont! 
gencies are TT, TF, FT, and FF. 
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TABLE 1 


Fourteen MAPPINGS or A STIMULUS POPULATION with Two FOCAL ATTRIBUTES 
ONTO A BINARY RESPONSE SYSTEM 


Dimensions | 
and levels I 
Pattern sets | Partitions 
Color | Form | 
Е =" | EV | | 
R Contin- | Two | Th | = | | | 
S gencies | level level Бү el (bal ci LIE K|L|M|N 
T ed RS |RS| RS -x|-|«|-I-|*|*|-|i-.-|*|-|-|- 
TUE | RE ув нё wp ES Oe үш ту жү E 
AERE араа Ea ad Rl od od d od a d d | d d 
P F RS | GT |GT.GC. 
вт.вс|-|+|+|+|-|+|-|+|+|-|-|-|-|+ 
Note.—The following abbreviations are used: S—square, T—triangle, C—circle, R—red, G—green, and 


B—blue, 


examples of the concept (positive and 
negative response categories, respec- 
tively). "This second mapping creates 
16 binary partitions of the stimulus 
population, two of which are trivial 
мр they place the entire popula- 
Sm in either the positive or negative 

ponse category. The remaining 14 
nontrivial partitions are defined by the 
Unique distribution of the four attri- 
bute contingencies (RS, RS, RS, and 
“ae into either the positive or nega- 
show category. The partitions are 
ea n in Table 1. Capital letters are 
Ssigned arbitrarily to the 14 map- 
“im of the stimuli onto the response 
gutes wath +” indicating those 
ud rus Um are positive instances 
сае 8 indicating those which аге 

gative, 
wee number of stimulus patterns 
вес „аге instances of each contin- 
Ше 5. obviously, a function of both 
sional mensional (x) and intradimen- 
of in variability. Manipulation 
affect aum variability does not 
Within € proportions of patterns 
change each contingency, whereas a 
clearly x intradimensional variability 
ig ih ш» The pattern sets result- 
thres-yat ne cases of two-value and 
n ue dimensions are presented 


In Table 1. 


Neisser and Weene (1962) have 
shown that there are only 10 different 
rules within this set of 14 mappings. 
This results from the fact that the fol- 
lowing pairs are precisely the same 
except for a change of relevant at- 
tributes: B and C, E and G, H and I, 
and Land N. The 10 remaining map- 
pings fall into 5 complementary pairs 
having the property that any instance 
which is positive under one member 
of the pair is a negative instance un- 
der the other. Table 2 describes these 
complementary pairs both symbolically 


and verbally. 


Levels of Rules 


A symbolic description, using the 
operations of negation, conjunction, 
and disjunction as primitives, suggests 
that the 10 mappings may be further 
broken down into levels based on the 
complexity (or length) of the descrip- 
tive expression. The unidimensional 
(Level I) represents the simplest, fol- 
lowed by those six expressions which 
involve single conjunctions OT dis- 
junctions of both relevant attributes 
(Level II). Finall, the most com- 
plex expressions entail both conjunc- 
tive and disjunctive operations (Level 
III) Neisser and Weene pointed 
out that the successive levels form a 
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TABLE 2 
CONCEPTUAL RULES DESCRIBING PARTITIONS OF A POPULATION _ 
WITH Two FOCAL ATTRIBUTES 
Basic rule Complementary rule 
i Parti- 
Parti- 
i tion bal 
tion Я " Verl 
a Symboli Verbal d Symbolic 
Name descriptions description Name descriptions description 
i Negati R All patterns 

E | Affirmation R ea H Negation R uh are mol 
amples of the e "i the 
E dn concept. 

К | Conjunction RNS All red and D | Alternative RIS All райе 
square pat- denial (RuS] whicl ot red 
terns are ex- amas nO ate 
amples. a. то ев 

А Inclusive RUS All patterns N Joint RIS Ail parteras 

disjunction which are red denial RAS which nor 
j СК п5] her red 
or square or neither ex: 
both are ex- square are 
| amples. amples. 
s 
C | Conditional RS 1а pattern | L | Exclusion> RNS All patterns ed 
[RUS] is red then it whic! square 
must be and not ra 
square to be are ехатр! 
an example. 
F | Biconditional Res . |Redpatterns| J clusive RUS_ All pate red 
C(RAS)U(RNS)j eran plea disjunction (к п8)и nsy3| vl НВ ате bul 
they are not both are ex 
square, amples. 
ы... E ode 


* Symbolic descriptions using only the three 


hierarchical Structure, with the ele- 
ments of Level II expressions being 
the Level I statements and the ele- 
ments of Level III being Level II 
statements. Further they offer the 
hypothesis that the order of structural 
complexity within the System will be 
reflected in an order of difficulty when 
actual concept problems are given to 
human subjects for solution. 

It is clear that the ordering of con- 
cepts in this analysis arises from the 
selection of negation, conjunction, and 
disjunction as basic operations. Hunt 
(1962) has adopted a more elaborate 
System of operators to describe the 
same expressions or conceptual rules ; 
in fact, by reference to textbooks in 
mathematical logic (e.g., Quine, 1958), 
special operators or symbols for all 
but one of the rules of Table 2 can 
be found. There is nothing in Hunt's 


Д three basic operators, 
^ There is no special symbol for exclusion in general use, 


П, U, and negation, are given in brackets. 


system, other than the difference Er 
tween concepts based on one as ka 
posed to two relevant attributes, whic? 
would suggest any ordering of od 
plexity. Cases in which these oper?" 
tors differ from the expressions qu 
vided by Neisser and Weene can is 
seen in Table 2; Level III concep à 
Can be recast within the more pg 
rate system so as to appear of is 
Same order of complexity as Level 
concepts. re- 
Neisser and Weene, however, ity 
Port a study of the relative айтеп 
of attaining concepts at the three ap^ 
implied by their system, and the sis 
sults clearly support the Пура 
that concepts of hierarchically Big m 
levels are more difficult to attain t й, 
those of lower levels. In un 
although there is a general mr 
ment in performance across two 
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cessive concept problems of the same 
type, the difference in difficulty is still 
evident on the second problem. Ex- 
ploring fewer different concept types, 
Hunt and Kreuter (1962) found simi- 
lar results, | 


Rule Learning 


The analysis above immediately 
raises the question of why, from a psy- 
chological point of view, higher level 
Concepts are more difficult to attain. 
Hunt offers an interpretation based on 
the fact that any deterministic rule 
can be expressed as a tree of decisions. 
E. tree specific to any rule is learned 
7 the subject, assumedly through the 
utilization of a general strategy applica- 
ble to all conceptual problems. Theo- 
Tetically, the subject begins by trying 
to solve the problem as a conjunction 
of attributes, but, in doing 50, тау 
P (when a nonconjunctive so- 

ition holds) subproblems which re- 
quire special treatment. The general 
Strategy may be realized in a com- 
1" program which yields results 
к е te P order of concept difficulty 
Ben in real data (Hunt & 
n Ws 1962). Neisser and Weene, 
irer e other hand, lean toward an 
age based on the hierarchical 
the mauus of concepts, which pictures 
© end as working up from simple 

А Brie арыу groupings of stimuli. 

eel ire y correctly an instance of a 

Bel I concept, the subject must have 
ен, concepts available. as com- 

on Similarly, to utilize Level 
with T Pts, the subject must be familiar 
Bins -evel T components. Thus to at- 
ee eri sek concept, the subject must 
сары = therefore have attained con- 
tion inn a Byer level. This interpreta- 
ёз 15 165 that the subjects do not 
rather s el TIT concepts as such, but 

eir m e or induce them from 
"ponent parts. 
Were аге at least two other con- 
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siderations relevant to the available 
data, neither of which is necessarily 
incompatible with the aforementioned 
interpretations of the subjects be- 
havior. The first is suggested by an 
informational analysis of problems, 
which provides one means of measur- 
ing the complexity of decision trees 
corresponding to different rules. 
Level I concepts have one relevant 
dimension, whereas Level II and III 
concepts have two; number of rele- 
vant dimensions (or bits of relevant 
information) has been shown to be a 
powerful determiner of task difficulty 
(e.g, Bulgarella & Archer, 1962). 
The greater difficulty of Level III as 
compared to Level II concepts may 
result from differences in the map- 
ping of the four stimulus contingen- 
cies as shown in Table 1 (ie, TT, 
TF, FT, and FF) onto the two re- 
sponse categories. All Level II con- 
cepts involve a 3:1 split of the con- 
tingencies, whereas Level ITI concepts 
are based on a 2:2 split. In a sense, 
there is more uncertainty in the stimu- 
lus-category system of Level IIT con- 
cepts (Garner, 1962). This uncer- 
tainty is reflected in the lack of 
homogeneity or commonality among 
individual stimulus patterns in either 
the positive or negative category of 
Level III concepts. Indeed, each cate- 
gory within Level III concept types 
consists of two subsets whose members 
have, by definition, no relevant attri- 
butes in common. Thus the highly 
efficient strategies based on discovery 
of common attributes (Bruner, Good- 
now, & Austin, 1956) are eliminated, 
and must be replaced by different, and 
perhaps more difficult and less efficient, 
strategies based on multiple contin- 
gencies. This interpretation suggests 
that differences in difficulty among 
concept levels (and perhaps within 
levels as well) result from differences 
in stimulus uncertainty—in effect, from 
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differing information requirements to 
solve the problems. 

It is also possible that the observed 
ordering between rules is little more 
than a reflection of differing familiarity 
with the rules and their appropriate 
strategies. With equal familiarity, con- 
cept forms at Levels II and III may 
be equally difficult to attain. Suggestive 
of this is Wells’ (1963) demonstration 
that the evident preference of naive 
subjects for conjunctive as opposed to 
disjunctive concepts in problems which 
could be solved by either rule (Hunt & 
Hovland, 1960) can be modified by 
preliminary training and familiariza- 
tion with disjunctive concepts. Exten- 
sive training on several mappings of 
Table 1 may produce equal facility with 
each type of concept, thus modifying 
the hierarchy described by Neisser and 
Weene (1962). 


Two Aspects of Conceptual Behavior 


With the exception of that of Conant 
and Trabasso (1964), each of the stud- 
ies of comparative rule difficulty dis- 
cussed above has required the subject 
both (a) to identify the relevant attri- 
butes, and (b) to learn the assignment 
of levels on relevant dimensions to re- 
sponse categories as prescribed by the 
rule. That is, for the subject each prob- 
lem had two unknowns, This con- 
founding of attribute identification with 
tule learning creates certain difficulties 
in the interpretation of results. For ex- 
ample, it has not been established that 
the rules differ in difficulty in the ab- 
sence of the necessity to identify rele- 
vant attributes. Clearly, the strategies 
appropriate for identifying relevant at- 
tributes differ among rules, and it may 
be that those which are useful in a 
conjunctive problem are easier to learn 
or employ than those associated with a 
biconditional. "This possibility is sup- 
ported by the finding of Conant and Tra- 
basso that naive subjects learn to use a 


positive-focus strategy in a conjunctive 
problem earlier than they learn to use 
a negative focus in a disjunctive prob- 
lem. However, it is possible that at- 
tribute identification under different 
conceptual rules might not differ if 
adequate instructions and training on 
the relevant rule were given. | 

The foregoing discussion suggests 
that, paralleling the analysis of concep- 
tual problems into rules and attributes, 
there is a distinction between two in- 
dependent kinds of behavior: (a) rule 
learning, in which the subject attains or 
discovers the principle for pane 
the stimuli in a particular problem an 
acquires the rule in general form A 
that he can use it in any problem ; anc 
(5) attribute identification, in which sie 
subject attains or discovers the relevant 
attributes. One major purpose of ne 
Paper is to provide some experimenta 
evidence on the validity of this a 
to demonstrate the separability of ue 
two forms of behavior, and to indicate 
the unique features of each. 


EXPERIMENT I 


In the first experiment, the subject 
were presented with a series of бе 
Problems, each based on the same 
but with a different pair of relevant 2 
tributes. Each subject served in one Mw 
three main conditions which are labele? 
rule learning (RL), attribute identifica 
tion (AI), and complete learning (CL = 
Та the RL condition, the subject у-и 
given the names of the two relevant а 
tributes at the start of each pn 
His task was to learn the proper ion 
ment of a set of stimulus patterns зе 
the response classes according to ен 
unknown but preselected rule ШАП, 
ing these attributes. In the АТ m 
one rule was explained and шиге, 
for the subject at the outset, leaving d 
the task of identifying the pei 
relevant to the concept in each prob ‘the 
This case represents in essence 
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experimental procedure used in earlier 
studies of concept identification. In the 
CL case, the subject was told neither 
the rule nor the attributes at the outset, 
but had to attack each problem given 
only a description of the stimulus popu- 
lation and the size of the concept (two 
relevant attributes). 

One specific purpose for separating 
out RL was to determine whether 
the rules are intrinsically different 
in difficulty when the subject is not 
Tequired to identify relevant attributes. 

f so, then it can be expected that, at 
least on the initial problems, different 
Tules will yield different numbers of 
errors and trials to solution. In con- 
trast, the AT condition (which provides 
thorough preliminary instruction on 
the rules) was designed to assess the 
relative difficulty of identifying rele- 
vant attributes under different rules. 

hereas expectations regarding RL 
се hg clear, there is some evidence in 
that ae to support the expectation 
problems will differ in difficulty 

ven though rule instruction is given. 
i Goodnow, and Austin (1956) 
аи eo the utility of a pow- 
P holist strategy for identifying the 

"ii attributes of a conjunction. 
bir m allows the subject to form 
ris à composite hypothesis which 
беке T on any trial all tenable solu- 
= ae Ка with this strategy there is 
Pob or the subject to consider in- 
не: на all possible pairs of rele- 
bed: ributes, Such a strategy is, in 
эз = not appropriate to the disjunc- 
studied to any of the other rules to be 
iss i in this experiment. The strate- 
Tiles hich are available for these other 
Soke no convenient ‚ау of 
Simul а, er hypothesis which 
Solutions usly represents all possible 
irs CL condition was included for 
mdr “asons, First, it provides con- 

У with the experiments of Neis- 
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ser and Weene (1962), and of Hunt 
and Kreuter (1962). Second, CL pro- 
vides a base line for comparison to AI 
and RL to determine the effectiveness 
of instructions that, in one case, pro- 
vide the relevant attributes and, in the 
other, the relevant rule. If knowledge 
of either feature is important to concept 
learning, performance on even the first 
problem should show significant dif- 
ferences between CL and the other 
two conditions. 

To determine the effects of prac- 
tice, a series of five problems having 
the same solution form was given in 
all conditions. | Assuming the effec- 
tiveness of instructions, improvement 
across successive problems could be at- 
tributed to the following factors. In 
RL, to an increasing familiarity with 
the way a rule assigns stimulus patterns 
to the response categories; in AT, to the 
development and utilization of strate- 
gies appropriate to the identification of 
relevant attributes under a given rule; 
and, in CL, to both factors. 


Method 


Subjects and Design. Sixty students from 
introductory psychology classes participated 
individually as subjects and were assigned 
in order of appearance to one of 12 treatment 
combinations. All subjects were naive in 
that they had not previously participated in 
a concept-learning experiment. 

The experimental design was a 4x3 re- 
peated-measures factorial, incorporating con- 
cept problems generated by four different 
rules and three instructional conditions. The 
rules used, viz., conjunction, inclusive dis- 
junction, joint denial, and conditional, were 
selected arbitrarily to include (a) only Level 
II types (Neisser & Weene, 1962) and 
(b) one pair of complements. The three 
instructional conditions were AI: the rule 
and all stimulus dimensions were described 
and illustrated, but the relevant attributes 
were unknown to the subject; RL: all the 
stimulus dimensions and the relevant attri- 
butes, stated in TT form, e.g. red square, 
were given, but the rule was not explained ; 
and CL: all dimensions, but neither the rule 
nor the relevant attributes were described. 
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Each subject solved five successive prob- 
lems utilizing the same rule within the same 
instructional condition. A different pair of 
relevant attributes was used in each problem, 
and five different sequences of pairs were 
used to avoid confounding with ordinal 
position. 

Materials and Apparatus. The stimulus 
patterns were geometric designs, prepared 
on 4X 6 inch cards and varying along four 
three-value dimensions. Dimensions and 
their values were: number (one, two, and 
three figures), size (large, medium, and 
small), form (square, triangle, and hexa- 
gon), and color (red, yellow, and blue). 
From Table 1 it is clear that the numbers 
of positive and negative instances within 
this population of 81 stimulus patterns vary 
markedly with the rule. In studies using 
only a single rule, the proportion of positive 
instances has been shown to be a significant 
determiner of rate of learning, even when 
the stimulus sequences have been equated for 
information content (Hovland & Weiss, 
1953). To avoid confounding rule differ- 
ences with this factor, extra cards were in- 
serted in the basic deck when necessary to 
insure roughly equal numbers of positive 
and negative instances. 

A sample card showing all possible values 
of the dimensions was left exposed for the 
subject's reference at all times. Special cards 
showing Venn diagrams of the conceptual 
rules to be used in the experiment (rule 
cards), and cards listing the pairs of relevant 
attributes to be used (attribute cards) were 
prepared. The subject sat at an ordinary 
desk facing the experimenter. Presentation 
of stimuli and recording of responses were 
performed manually. 

Task and Procedure. The subject was re- 
quired to sort, or classify, a series of visually 
presented stimulus patterns into two cate- 
gories. The correct classification for any 
stimulus was determined both by the pair 
of attributes relevant in the particular prob- 
lem and by the conceptual rule which speci- 
fied the relationship between these attributes, 
In each problem two dimensions were ir- 
relevant to solution. 

At the outset, all subjects were given de- 
tailed oral instructions describing the stimu- 
lus population and the task. They were told 
that stimulus patterns would be presented 
one at a time and were to be sorted into two 
categories, viz, those that were and those 
that were not examples of the concept. 
Emphasis was placed on finding the two 
attributes which characterized the class of 
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positive instances; despite this emphasis on 
the positive category, the subjects were 
warned that both the presence and absence 
of particular attributes might be important. 
Instructions given to all groups were identi- 
cal except for those portions restricted by 
the nature of the experimental conditions. 
The subjects were told that there would 
be five problems, each utilizing the same rule 
but each with a different pair of relevant 
attributes. Prior to the first problem, the 
subjects in the AI condition were given the 
appropriate rule card for their problem 
Series. Before each problem, the subjects 
in the RL condition were given an attribute 
card listing the pair of rclevant attributes. 
The attributes were read to the subject in 
the order in which they appeared on the 
attribute card. However, no emphasis was 
Placed on the fact that one came first, OF 
that the order in which the attributes ар" 
peared might be related to problem solution 
The subjects in the CL condition were told 
only that two attributes would be relevant 
and that these would be combined by some 
kind of rule. 

To each successive stimulus card, the sub: 
ject was required to respond “yes” or ni 
thus indicating whether it was an example 0 
the concept ог not. Following the subject $ 
Tesponse, the experimenter provided feed- 
back by saying “right” or “wrong” and by 
Placing the stimulus card face up on the 
desk in an area marked YES or NO zi 
appropriate. All problems were selí-pace 
in that the subject was allowed as much time 
aS desired to respond, and instructions 
Stressed accuracy rather than speed. 6 

The criterion of problem solution was y 
consecutive correct responses. When E 
problem was completed, the subject was T€ 
minded that the next problem would involve 
the same rule but a new pair of relevan 
attributes, and those in the RL condition 
Were given a new attribute card. Program- 
ing of card sequences was randomized 


= 1 
shuffling the deck several times before cac 
problem. 


Results and Discussion 


The mean numbers of errors and 
trials to solution observed for all five 
Problems in each of the 12 main condi- 
tions of the experiment are reported 1? 
Table 3. Overall analysis of variance 
on errors demonstrated four statistic- 
ally reliable effects, First, for all co" 
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cept types, CL, wherein both the rule 
and the relevant attributes were un- 
known, was more difficult than AI, 
which in turn yielded a greater number 
of errors than RL (Е = 19.34, df= 
2/48, р <.01). Second, there was 
steady improvement in performance 
over problems (F = 13.15, df = 4/152, 
Р <.01) and no evidence of interaction 
between problems and instructional 
conditions. Third, rules differed gen- 
erally in difficulty (F — 12.24, df — 
3/48, p <.01) and according to the 
order observed by Hunt and Kreuter 
(1962). Finally, the interaction of 
rules and successive problems was re- 
liable (F = 2.91, df = 12/152, р <.01) 
reflecting the fact that the difficulty of 
joint denial, disjunction, and condi- 
tional concepts changed markedly in 
Comparison to conjunctive concepts. 
Identical results held for the analysis 
of trials to solution. 

Further inspection and analyses of 
the data in Table 3 revealed that, in 
RI. conditions, essentially perfect per- 
formance was reached by all subjects 
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on the fourth problem on all rules ex- 
cept the conditional. Initial expecta- 
tion of and familiarity with the con- 
junctive rule was clearly evidenced by 
the fact that only one subject made any 
errors on these RL problems, and he 
made these only on two of the five 
problems. Fractional mean errors on 
later disjunction and joint denial prob- 
lems resulted, in both cases, from a 
single subject making one or two 
errors per problem. Complete mastery 
of these three rules is demonstrated by 
the immediate solution of each problem 
by the subjects upon being provided 
with the relevant attributes. 

The overall greater difficulty of the 
conditional rule appeared to result 
from two sources. First, the size and 
nonhomogeneity of the class of positive 
instances: inspection of Table 1 shows 
that the conditional rule classifies a 
greater number of instances as positive 
than any other and moreover uniquely 
requires both TT and FF (e.g., red 
square and not red, not square) to be 
grouped together. One implication of 


TABLE 3 


MEAN Number or Errors (E) AND TRIALS (T) TO SoLUTION IN EXPERIMENT 1 


Problem 
Rule bd Е 7 | " i 3 
condition Е 
Е T E T E T E T E TY 
ond AI зо | 134| за |120 | 16| 16 | a4 155] 22) 74 
Onjunction | КІ, E 2| a| 410 | o | o {о о |? 
CL 36 | 1401 30 | o4| 32] 118] 40] 162) 20) 76 
Dis; AI 12.1 | 328| 76|320| 40 |180 | 4.2 9.8 42 p 
"Sjuncion | КІ, | азер do 42) a| BO | rh] су 
QU | зто 1120.81 17.0 | 524 | 22.2 | 75.6 | 11.0 | 38.8 | 13:0 49.2 
Joi AI o2| 32.6] 114 | 356 | 64 |215 | 86|270| 34 166 
Ош denial | RL 86 | 2861 34 11181 16] 46] 0 | 0 A44 
GL |70 | 670 120 | 49:4 | 162 | 62.6 | 18.6 | 67.0 | 172 65.6 
5 sis | 90 | 34.0 
eth ! 96.8 | 494 | 12.2 | 47.8 | 12.6 | 510 | 13.8 | 54.5 | 9 
Conditional RI 164 | 57.5 iss | 68 | 23.5 | 76) 230) 64 | 305 
І “Т лез 11-4 | 278 | 11-2 | 30.6 | 20.6 | 60.6 | 110 37.6 
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this is that focusing on the negative 
category (Hunt, 1962), which was not 
encouraged by the instructions of this 
experiment, would facilitate the learn- 
ing of conditional concepts. Second, 
the asymmetry of the rule: whereas 
the partition generated by any of the 
other three rules is invariant with com- 
mutation of attributes, e.g., red and/or 
square is identical to square and/or 
red, only one of the two possible at- 
tribute orderings is correct for a con- 
ditional problem. The effect is to place 
special emphasis on the difference be- 
tween TF and FT instances. Because 
the order of attributes was not stressed 
in the instructions, the subject was 
posed the additional task of making 
this basic distinction. Additional evi- 
dence on the difficulty of this task is 
provided by Experiment IT. 

Once the subject has learned the rule, 
it is obviously possible to categorize 
patterns without error in any RI con- 
dition. In contrast, AT requires, on 
logical grounds, the presentation of a 
certain minimal number of instances 
since the subject must discover which 
dimensions and values are relevant, 
Even the most efficient search strategy 
will be accompanied by a number of 
errors commensurate with the number 
of trials necessary to eliminate all but 
the correct pairing of attributes. 

Although the specific patterns shown 
to each subject differed and were not 
recorded, it is possible to compute the 
expected minimal number of trials re- 
quired for problem solution under the 
AI condition for any rule. These 
minima have been computed for the 
stimulus population used in the pres- 
ent experiment under the following 
assumptions: (a) the subject has limit- 
less memory, i.e., never forgets infor- 
mation previously presented, and (2) 
the subject tests possible hypotheses 
using a strategy equivalent to the 
“simultaneous scanning” technique 
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(Bruner et al., 1956). The computed 
values are 4.7, 10.8, 9.7, and 8.0 for 
conjunction, inclusive disjunction, joint 
denial, and conditional, respectively. 
Inspection of Table 3 shows that this 
theoretical minimum is not attained for 
any rule, although it is approached for 
conjunctive problems, It is clear, both 
from the present data and from those 
of Bruner et al. that the subjects do 
not use simultaneous scanning. Any 
strategy adopted by the subject will 
require a greater number of trials than 
a maximally efficient scanning process, 
and the minimal number of trials will 
depend on the strategy actually used 
hy the subject, . 
While it is impossible to determine 
exactly what strategies, if any, were 
used by the subjects, inspection of the 
crror data suggests that the subjects 
were not responding in accord with 
randomly selected binary hypotheses 
Prior to solution, for such a strategy 
would give an error total equal to about 
half the number of trials. Instead, the 
actual mean errors approaches the 
minimum commensurate with the theo- 
retical minimum trials for all rules ех- 
cept conditional. In other words, it 
would appear that error performance 
in conjunction, inclusive disjunction, 
and joint denial was about as efficient 
at the end of five problems as it сап 
be. The small ratio of errors to trials 
Suggests that many subjects solve the 
problem in parts, for example, by a: 
tially discovering one but not both а 
the relevant attributes (Trabasso © 
Bower, 1964). А 
The overall difficulty of the condi- 
tional rule is reflected in performance 
within the AT condition, Numbers 9 
trials and errors are greater over à 
five problems than for any other rule. 
The failure of the subjects to obtain 
conditional rule mastery in five pet 
secutive RL problems suggests n 
part of the difficulty within the 


ASPECTS OF CONCEPTUAL BEHAVIOR 185 


condition results from a lack of under- 
standing of the rule itself. 

_Except for conjunctive concepts, CL 
yields more errors and requires more 
trials than AT, even for the fifth prob- 
lem, This result demonstrates the ef- 
fectiveness of prior instructions about 
the rule, and provides further evidence 
that both RL and AI are involved in 
the solution of conceptual problems. 
The improvement in CL performance 
15 consistent with the findings of 
Neisser and Weene (1962). It is in- 
dicative of the possibility that, with 
further training, the subjects will be- 
come as facile with problems of a non- 
conjunctive nature as they are with 
conjunctive problems at the outset— 
within the constraints of the amount of 
information required for problem solu- 
tion, However, failure of the CL 
Sroups, in general, to attain the same 
level of performance as AI groups after 
"mue problems indicates that rule 
Loe proceeds more slowly when 
the subject must, in the same problem, 
identify relevant stimulus attributes. 
Clearly, then, CL is more than the 
Simple sum of AI and RL. 

At least after the first problem, per- 
Ff eus on the disjunctive and joint 
жыз prin is virtually identical, as 
they a e suggested by the fact that 
e e complementaries. The differ- 
is etween these concepts on the 
doce which is not statistically 
initial oe may be attributable to the 
Get effect of instructions emphasiz- 
the ne class of positive instances, but 

evidence is far from compelling. 


Experiment II 


No condition of Experiment I 
Be i training for the subject in 
=a Pp- np of , Stimulus patterns 
Spine m bute contingencies) to re- 
the snt categories by a single rule. If 
Ss a can be trained to perform 

ately on two or more rules, it 


becomes feasible to present to him a 
conceptual task which may be labeled, 
for sake of consistency, rule identifica- 
tion (RI). In RI the subject is given 
the relevant attributes and is required 
to identify which of the known rules 
determines problem solution. The 
task, obviously, is designed as an ana- 
logue of AT problems, wherein the sub- 
ject must identify which of several 
known attributes determines solution 
under a given rule. From Table 1 it is 
clear that the assignment of one or 
more of the four contingencies of rele- 
vant attributes will differentiate any 
two rules, and that presentation of all 
four contingencies will determine 
uniquely which rule is correct. Thus 
the subject’s performance may be com- 
pared to an efficient algorithm, which 
solves such RI problems in four trials 
if each trial provides an example of 
a different contingency of the relevant 
attributes. In general, a certain mini- 
mal number of instances, positive or 
negative, is required to eliminate all 
but the correct solution for any RI 
problem, just as in AT. Assuming 
prior familiarization and equalization of 
rules through RL, however, no differ- 
ence in rule difficulty would be ex- 
pected. 

The purpose of Experiment П is to 
investigate RI performance after prior 
RL training on four different rules 
selected to represent both Levels II 
and III from the Neisser and Weene 
(1962) hierarchy and a high degree of 
diversity in the way they assign stimu- 
lus patterns to categories. 


Method 


Subjects and Design. Twenty-four stu- 
dents from introductory psychology classes 
served as subjects and were assigned in 
order of appearance to six training groups. 
None of the subjects had participated in 
Experiment 1; however, approximately 
three-fourths had participated in other con- 
cept-learning experiments, involving the 
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identification of attributes in unidimensional 
or conjunctive concepts. Because of task 
differences, and because these subjects were 
distributed without bias over experimental 
treatments, prior experience was assumed to 
have no effect on the present results other 
than possibly an overall improvement in 
the general level of performance. 

The experiment consisted of two parts, an 
RL and an RI phase. In the RL phase, the 
subjects were presented with a series of con- 
cept problems based on three different rules, 
viz, disjunctive, conditional, and bicondi- 
tional. The six possible sequences of rules 
were used, each for an equal number of 
subjects. Successive problems utilized dif- 
ferent pairs of relevant attributes, which 
were named for the subject before each. 
The purpose of this phase was to train the 
subjects in the use of these three conceptual 
rules, plus a fourth, the conjunction, which 
was explained in detail and illustrated during 
initial instructions. 

For the RI phase, a balanced incomplete- 
blocks design, with four rules, two pairs of 
relevant attributes, and two successive prob- 
lems, was used. The two problems were 
based on two of the four rules on which 
the subject was trained during the RL phase. 
The two pairs of relevant attributes, which 
were named for the subject at the outset 
of each problem, were counterbalanced with 
problem order and with relevant rule. 

Materials and Apparatus. The stimulus 
patterns were the same as those used in 
Experiment I. Sequences, however, were 
Prepared on filmstrips, and the form of the 
subjects’ responses changed from verbal 
statements to pressing buttons, labeled “yes” 
and “no,” on a control panel. The apparatus 
was the same as that used in previous con- 
cept-identification studies (e.g, Bourne & 
Haygood, 1959). It was programed to 
present an immediate informative feedback 
signal of l-second duration to the subject 
after each of his responses. The total time 
between the subject’s response and the ap- 
pearance of the next stimulus pattern was 
fixed at 5 seconds. 

In preparing stripfilms, several constraints 
were imposed on the sequences of stimulus 
patterns, in addition to the equation of num- 
ber of positive and negative instances. First, 
and most important, the regular occurrence 
of a representative of each of the four at- 
tribute contingencies shown in Table 1 was 
provided. Particular attention was paid to 
insuring that instances critical to the distinc- 
tions between the four rules occurred at least 
once during any sequence of 16 patterns (the 
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length of the criterion run). Second, runs of 
the same category (yes or no) were limited 
to five. Third, where patterns were repeated 
because of using extra cards to equate posi- 
tive and negative instances, care was taken 
that no two consecutive patterns were identi- 
cal. Fourth, each problem began with a 
pattern containing both relevant attributes, 
ie, the TT attribute contingency. В 

Task and Procedure. The task and in- 
structions were essentially those used in the 
RL condition of Experiment I, with the 
exceptions that (a) equal emphasis was 
placed on observing instances of both the 
positive and negative categories and (b) an 
elaborate explanation of each rule was given 
after performance on its associated prob- 
lem. Detailed instructions were given the 
subject concerning the stimulus dimensions 
to be employed in the problems, the opera- 
tion of the apparatus, and the nature of con- 
cepts and conceptual problems. During the 
instructions, the conjunctive rule was €x- 
plained and used as an example, and the sub- 
ject was given the conjunction rule card. 
The subject was told that there would be 
three training problems, each employing а 
different rule, and that subsequently there 
would be two problems in which the solu- 
tion could be any one of the four rules. 

The stimulus patterns were projected one 
at a time on the screen before the subject. 
To categorize a pattern, the subject re- 
sponded by pressing either the yes button 
or the no button on the control panel below 
the screen. The correct response was then 
indicated by the illumination of a signal lamP 
above the proper button. АИ problems 
were self-paced, the subject being allowed a5 
much time as needed to make any response: 
The criterion of problem solution was. 
consecutive correct responses. Any subject 
failing to solve the first problem in 45 d 
utes was arbitrarily deemed a nonsolver an 
dismissed from the experiment. Two sub- 
jects were discarded under this criterion , 
One other subject was eliminated for being 
unable to solve the third problem and bent 
unwilling to continue in the experimen 
Three other subjects were eliminated fo 
reasons unconnected with the experimenta 
treatment, such as apparatus malfunction. 

Following solution of the first problem, 
subject was given the proper rule card an ^ 
thorough explanation and further examp- 
of the rule. When the three RL problems 
were completed, the subject was reminded = 
all four rules and that the next two Pre а 
lems would each utilize one of the fou 
rules he had just learned. 


the 
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AT е x 
Results and Discussion 


Mean numbers of errors and trials 
to solution of all problems are given 
in Table 4. Analyses of variance on 
the two dependent response measures 
showed identical outcomes; therefore, 
only significance tests based on errors 
are reported. During the RL phase, 
sources of variance identified with rules 
(F = 9.19, df = 2/40), ordinal posi- 
tion of problems (F = 9.15, df= 
2/40), and their interaction (F = 6.08, 
df = 4/40) were statistically significant 
(р <.01). The initial order of diffi- 
culty among rules is precisely the same 
as that observed by Neisser and Weene 
(1962) and by Hunt and Kreuter 
(1962); these differences, however, 
are attenuated on the latter two RL 
problems. Because, in this phase, only 
one working problem of each type was 
given to each subject, ordinal position 
and interaction effects are attributable 
to à general transfer among rules, par- 
ticularly from other rules to the bicon- 
ditional, rather than (as in Experiment 
1) a specific process of learning how to 
ee problems characterized by a single 
rule, 


Number of trials to present at least 
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one stimulus pattern representative of 
each type of attribute contingency sets 
one logical limit on the minimum infor- 
mation necessary to categorize all pat- 
terns correctly. These values, com- 
puted directly from the stimulus se- 
quences used, are presented in Table 4. 
A comparison of trials to solution with 
this base line indicates that, with the 
possible exception of later disjunctive 
problems, the subjects do not solve 
problems with maximal efficiency dur- 
ing the RL phase. This is hardly un- 
expected in view of the training routine 
employed and the relative unfamiliarity 
of these rules at the outset. 

There are significant differences in 
difficulty among conceptual rules in the 
RI phase (F = 6.39, df = 3/18, p 
< 01) with biconditional and condi- 
tional problems producing a greater 
number of errors than the conjunctive 
and disjunctive. The reliability of 
these differences is enhanced by the 
limited variability in performance on 
conjunctive and disjunctive problems, 
wherein solutions were attained in a 
minimal number of trials in 75% and 
83% of cases, respectively. Identifica- 
tion of the conditional and bicondi- 
tional rules was accomplished in a 


TABLE 4 


< NuMBER ок TRIALS (TS) AND Errors (E) TO SOLUTION AND NuMBER OF TRIALS 


NECESSARY TO PRESENT AT LEAST ONE REPRESENTATION OF EACH ATTRIBUTE 
COMBINATION (TP) 1х ALL PROBLEMS, EXPERIMENT II 


Rule 


Conjunction 
disjunction 
Conditional 
conditional 


Problem 
Rule learning | Rule identification 
| 

1 Э, | 3 | 4 5 
TS m E | TS be E | TS | E | TS | TP 

| — - | 
== [== e | 8.0) 8 3.2 | 5815 
54 | 4} 5 | ъз | 4215) 2:3 | 72] 3 
63 |155|9 | 3.5 | 8.817 68 | 16.3) 6 
86 127215 | 8.5 | 30.2] 9 | 62 | 278] 5 


ote.—There were no conjunctive problems given duri 


ing the RL phase. 
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TABLE 5 
MEAN NUMBER OF TOTAL PRESENTATIONS (TP), PRESENTATIONS 10 Last Error (NP), 
MEE Errors (E), AND RATIO or ERRORS TO PRE TATIONS (E/NP) 
FOR Four ATTRIBUTE COMBINATIONS, EXPERIMENT II 
Disjunction Conditional Biconditional | Conjunction 
Attribute = ] 7 7 5 | T 
i E (ice TP) NP] E | EE | re | N | E EL| TP | NP | E | NP 
ШК. | = zo. ÁN RE, 
m Rule learning, Problems 1-3 ER 
= - m Т 
TT 1.5 -4 | -3| .80 | 3.0 9) .6| 63| 43 2.8| 1.0 36 | 
TF 2.3 | 1.4 |1.2| .88 | 9.2 | 5.2 | 2.6 -53 | 80| 6.4] 3.5 Б 
ЕТ 3.3 | 2-0 |1.3| -65 | 3:1 | 1.3 | 1.1 -87| 83| 5.9| 3.3 1.57 | 
FF 4.9 | 2.5 | .9|.36 | 6.6 | 6.3 | 4.1 -65 | 15.8 |11.3 | 6.3 |.56 
Rule identification, Problems 4-5 
| | ei | | Т | 
TT EDNO 10.) — lie! а aues az] s oa |.43| 30| .6].4 * 
TE |10) 95.5.36 6.0) 40 | L8| 45| 57] 82/24 [46 13 | 12] 973 
FT 4|13]|.8/.61|15| 12] 9 45| 7:9] 3:8] 2.1 :98 | 11 9 |.6 40 
FF 2.3 | 1.7 | .6/.35 | 37 | 3. 2.2 | .65|11.7| 50 2.5: [:50 | 1.5: 1.51.61 
| zx \ ЖЕЛ | 


minimal number of trials by only 33% 
and 25% of the subjects, respectively, 

As in Experiment I, errors were 
made on fewer than 50% of the trials, 
as is shown in Table 4. To determine 
the correctness of an assumption that 
this outcome resulted from learning the 
assignment to response categories of 
certain contingencies more rapidly than 
others, problem-by-problem analyses 
were made of (a) the number of pres- 
entations of each type of attribute con- 
tingency prior to the first trial of the 
criterion run, (b) the number of pres- 
entations of each type of attribute con- 
tingency prior to the last error on an 
instance of that type, and (c) the 
number of errors made on each type 
of attribute contingency. The results 
of those computations are shown in 
Table 5 for the three RL and the two 
RI problems separately. Although 
there are variations across problems 
and across attribute contingencies 
within problems, the proportion of er- 
rors on trials up to and including the 
last error trial computed separately for 


each contingency (the ratio of c to Ё 
above) averages .56 in RL problems 
and .53 in RI problems, Thus, pat- 
terns apparently were assigned to cate- 
gories with chance probability of being 
correct until the assignment was 
learned, after which no further errors 
were made. r 
Variation among attribute contin- 
gencies in number of trials to last erro" 
reveals some sources of rule difficulty- 
The scores identified with the TT com- 
bination may be discounted because 
(a) an instance of this type was age 
first presented in every problem, a үт 
commonly discovered and utilized д 
the subjects, and (b) the instance 2 
positive for all rules, thus not one whic | 
distinguished among them. Difference? 
among the three remaining continge” 
cies were not great for disjunct? 
problems in the RL phase. On the - 
erage, the subject required presentat! xi 
of about twice as many of each type * 
was necessary on logical grounds. TF 
conditional rule problems, however; of 
and FF required a greater number 
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presentations to last error than did FT 
(F = 3.71, df = 2/42, p < .01). Both 
instances are critical to the disjunction- 
conditional distinction, their assign- 
ment to categories being reversed un- 
der the two rules; and the difference 
may be a reflection of the greater ease 
of learning or familiarity with disjunc- 
tions. Note also that any strategy 
which emphasizes the search for com- 
mon attributes among instances of the 
positive class encounters difficulty with 
the conditional, which assigns patterns 
of the TT and FF types to the positive 
category. This is apparent in per- 
formance on biconditional problems, 
also, wherein the largest number of 
Presentations to final error was asso- 
ciated with FF instances (F = 3.07, 
df = 2/42, p « .10) which, as in con- 
ditional problems, are assigned along 
With TT instances to the positive class. 
In this rule, of course, any instance 
Tepresenting one but not the other 
relevant attribute (TF, FT) must be 
Categorized as a negative instance. 

€arning the assignment of these in- 
Stances is apparently more difficult un- 
der the biconditional than under any 
Other rule (Е = 3.25, df = 2/110, р 
< .05). 

A similar analysis of RI problems is 
Presented in Table 5. Except for over- 
E рне „performance, the attribute 
dif Ingencies are aligned in order of 
"ai much as they were in RL 
а, ems. Mean number of presenta- 
e S 1s close to 1.0, and the error rate 
wita айе, :5 for all contingencies 
len the conjunctive and disjunctive 
os Obviously these rules were 
" T etely mastered either during RL 
of ee preexperimental experiences 
rd Subjects. Only 5 of 24 subjects 
fate more than the minimal number of 
ee Tequired by the algorithm. The 
tetas i Strongly suggests that the sub- 
reduc; ave acquired (a) the technique of 

Ing the population of stimulus pat- 
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terns into attribute contingencies and 
(b) the mapping of contingencies onto 
the response system associated with 
both the conjunctive and disjunctive 
rules. 

Three to four presentations of TF 
and FF instances for the conditional 
rule and four to five presentations of 
TF, FT, and FF instances for the bi- 
conditional were required on the aver- 
age before these RI problems were 
solved and the rule satisfactorily iden- 
tified. Incomplete mastery of these 
rules appears to be the most prosaic 
explanation of poorer performance. 
While there is strong evidence for im- 
provement on these rules, only 4 of 12 
subjects for the conditional and 3 of 12 
subjects for the biconditional solved in 
minimal trials. The fact that 6 of the 7 
subjects performing optimally on con- 
ditional and biconditional problems 
solved their other RI problem in mini- 
mal trials suggests that they have 
learned to use, at least intuitively, the 
principle for collapsing the stimulus 
population to attribute contingencies 
and that the difficulty of conditional 
and biconditional problems stems pri- 
marily from a failure to learn com- 
pletely the mapping of contingencies to 
responses necessitated by these rules. 


GENERAL Discussion 


The preliminary analysis outlined in 
this paper discloses two aspects of con- 
ceptual problems as they are commonly 
posed for subjects in the laboratory, viz., 
the learning of rules and attributes. 
Research to date has been concerned 
largely either with (a) the processes by 
which subjects, given the correct rule, 
discover or identify relevant attributes 
of a concept, or (b) performance in sit- 
uations wherein both the relevant attri- 
butes and the rule are unknowns. The 
present experiments considered the 
third case, attributes given and rule 
unknown, for purposes of logical and 
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empirical comparison with the first 
two. Primarily, this study provided 
a comparison of performance charac- 
teristics in RL and AI problems; its 
aim was to determine the advantages 
to be gained, if any, from a separation 
of attribute and rule aspects in studies 
of conceptual behavior. Secondarily, 
interest centered on the question of 
differences in difficulty among rules, 
and the possibility that these differ- 
ences may be attenuated through 
training. 


Rule Learning 


In Experiment I the observed im- 
provement across a series of RL prob- 
lems demonstrates that through train- 
ing the subject does acquire knowledge 
about how a rule assigns stimulus pat- 
terns to response categories. At least 
for the rules which are initially least 
difficult, conjunction, inclusive disjunc- 
tion, and joint denial, it has been shown 
that RL proceeds regularly to a level of 
mastery such that, given the relevant 
attributes, the subjects categorize stim- 
ulus patterns essentially without error. 

Training on a single rule bears ob- 
vious similarity to the regimen of 
learning-set studies (Harlow, 1949). 
Solving each of a series of individual 
problems yields continuous improve- 
ment, producing optimal performance 
on a given type of task. In the present 
situation, where some rules are familiar 
and relatively easy at the outset, while 
others—because of lack of training or 
interfering expectations—are relatively 
difficult, this procedure operates to re- 
duce or eliminate initial differences in 
difficulty. It may also be noted that 
the process called RL is not unlike 
concept formation, as that phrase is 
used by Piaget (1929; see also Flavell, 
1963). When one of Piaget's children 
learns the concept of conservation of 

quantity, he has acquired, in effect, a 
general rule or way of responding in 


a variety of problematic situations. 
For example, the attainment of con- 
servation is implied when a child indi- 
cates that (a) emptying a container of 
beads into another of different shape 
and capacity does not affect the quan- 
tity of beads, or (b) different spatial 
arrangements of poker chips on a table 
do not entail a change in number of 
chips. In the last analysis, it is not 
the specific attributes, inextricably tied 
to the rule, which the subject acquires 
as the concept, but rather the rule itself. 


Processes in Rule Learning and Iden- 
tification 


Rule learning may be construed as the 
process by which the subject as ri 
information on the assignment of al 
combinations of values on relevant 
stimulus dimensions to response cate- 
gories. In the stimulus population use 
in the present experiments, this means 
a mapping of nine different patterns 
(combinations of three values on each 
of two relevant dimensions) onto ? 
two-response system. Evidence from 
Experiment II, however, argues that 
RL introduces an additional, simpli- 
fying factor or heuristic into the йе 
gorization process. Simplification a 
sults from collapsing the 3 X 3, ! 
general m x n, matrix (representing 
combinations of values on relevant a 
mensions) to a 2 x 2 matrix, each ce 
of which contains a set of patterns; 
called here contingencies, characterizes 
by the presence or absence of the tW 
relevant attributes. To say, then, tha 
the subject has learned a rule is to в і 
that he understands how it unide 
assigns contingencies to response C? 
gories, А ho 

It is conceivable that a subject bre 
achieves, with practice, the capacity. _ 
Teact to all instances of each gonnie 
gency as members of the same C? 


; . : ing 
might derive an algorithm for learn"? 


"EP jon 
à new rule or for attaining solutio 
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any RI problem: Observe the assign- 
ment of one example of each of the 
four contingencies, then classify all fu- 
ture instances according to this scheme. 
This plan reduces a potentially unlim- 
ited population of instances to four, 
then merely maps these four onto a 
twofold response system. While it is 
probably true that this algorithmic 
process is not carried out formally by 
the subjects, it is not unreasonable to 
assume that many subjects perform a 
similar routine on a more intuitive level. 
. General RL, for at least some sub- 
Jects, may be conceived, then, as a 
two-stage process involving: (а) re- 
ductive coding of a stimulus popula- 
tion to the matrix of attribute contin- 
gencies and (b) acquisition of the 
assignment of contingencies to response 
categories which is unique to each of 
a variety of rules. Neither the data 
пог this analysis indicate that Step а 
is mastered before the subjects learn 
anything about a rule. Both imply only 
that sophisticated and efficient per- 
formance in problems which require 
the learning of a new rule or the iden- 
tification of a familiar one will include 
an intervening step which reduces the 
effective complexity of the stimulus 
Population, Evidence from Experi- 
ment II, however, makes it clear that 
Some subjects used the collapsing or 
Coding heuristic prior to complete learn- 
Ing of the more difficult conditional and 
biconditional solutions. The similarity 
ОЁ the coding principle to the formal 
truth table is obvious. If the fore- 
Soing interpretation is correct, the re- 
Sults imply the utilization, as a media- 
tor, of some informal version of this 
device, which of course is well known 
to have considerable power in logic. 


Rule Difficulty 


oe the subject learns a rule as a spe- 
a nc and unique mapping of contingen- 
i 3 $ > 
es, and uses differences in the assign- 
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ment of contingencies as a means of 
identifying the rule (solution) for any 
RI problem, the implication is that all 
rules which in some way entail the 
combination of two (or any fixed num- 
ber) relevant attributes are equally 
complex or difficult. It is well estab- 
lished that rules for combining two 
relevant attributes are not equally dif- 
ficult at the outset even when the 
attributes are known and probably take 
different amounts of practice to learn. 
However, there is evidence in the pres- 
ent experiments that such differences 
may be reduced through RL and prob- 
ably eliminated. Initial differences may 
be attributable to differing amounts of 
preexperimental experience with the 
various rules used here, and to peculiar, 
unfamiliar, or “unexpected” assign- 
ments of contingencies to response cate- 
gories, such as the assignment of both 
TT and FF instances, which have no 
common relevant attributes, to the 
same response class. It is possible, 
moreover, that variation in the uncer- 
tainty of pattern-to-response or con- 
tingency-to-response assignments, eg., 
Level III versus Level II rules (Neis- 
ser & Weene, 1962), may account for 
differences in initial difficulty. All of 
these sources of differential rule diffi- 
culty in RL apparently may be rendered 
ineffectual through training, leaving the 
subject with a repertoire of habits or 
strategies for solving RI problems with 
equal facility regardless of the rule. 
This repertoire may, in fact, be analo- 
gous to the knowledge of stimulus at- 
tributes which, it is assumed, the 
subject brings with him to most experi- 
mental situations. 


Rule and. Attribute Identification 


Rule-identification tasks are analo- 
gous to those requiring identification 
of relevant attributes. In both, one 
aspect of the problem is unknown, 
either the rule (RI) or the relevant 
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attributes (AT). In both, given a com- 
plete understanding of the possible 
values of the unknown and a systematic 
plan or strategy for isolating the cor- 
rect value(s), the subject may solve 
the problem in a computable minimal 
number of stimulus instances which de- 
pends almost exclusively on the num- 
ber of possible values of the unknown. 

The dichotomy implied by the terms 
RL and RI is debatable. The problem 
is that acquisition is in this case essen- 
tially a process of gradual change in 
performance with practice, Tt takes 
some arbitrary criterion to distinguish 
the attainment of habits from their 
utilization, One cannot appeal to su- 
perior performance during any so-called 
identification problem as evidence of a 
difference between learning and utili- 
zation of conceptual rules. Such su- 
periority, which indeed exists, can be 
construed as reflecting in the identi- 
fication phase merely a continuation of 
the improvement shown in the RL 
problems. The differentiation between 
RL and RI rests not on performance 
but in the nature of the experimental 
task. As the subject learns new rules, 
in fact, as soon as he has learned two, 
it is possible to present him with a 
problem requiring the identification of 
Which of the known rules is correct, 
If the subject has not completely mas- 
tered either or both rules, some RI, 
may occur. The degree to Which the 
subject actually has mastered the set 
of rules is the degree to which any 
problem involves true RI. 

It is difficult to Specify the degree 
to which improvement in performance 
over a series of AI or RI problems 
results from increasing familiarity with 
the rule(s), or from the development 
and utilization of Strategies appropriate 
to the identification of unknowns, 
Rules differ in familiarity to the un- 
sophisticated subject (Wells, 1963), 
and it is virtually impossible to equate 
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them on this factor through preliminary 
instructions, no matter how detailed. 
Thus, “true” АТ or RI conditions 
would necessitate thorough  under- 
standing of the relevant rule(s), such 
as might be provided by the mind 
ditions of the present experiments, 
prior to the presentation of Чеш 
tion problems. The effects of such 
prior training should be explored more 
thoroughly in subsequent experiments. 


Theoretical Comment 


Emphasis in this paper has been 
placed primarily on an analysis of con- 
ceptual problems ; the result is as muc 
à model of the task as it is a model o 
the organism. Even if the assumption 
is true that some subjects operate 0 
à problem with an intuitive truth table, 
it holds only after considerable gpa 
ence solving problems based on a ond 
ety of rules, Moreover, this assump 
tion in no way generates a description 
of the manner in which initial rules ex 
acquired, nor does it specify how чй- 
known attributes are identified. igs 
adequate theory of conceptual belii 
must embody a set of principles шет 
Provides ап empirically valid accou” 
of these Processes, e 

he model of efficient performance 
in RL and RI tasks presented m 
Presumes that the subject achieve á = 
encoding of known stimulus attributes 
into an effective truth table ; the unique 
distribution of the attribute continge? 
cies of the truth table into categor 
then identifies the relevant rule. Ме 
theoretical interpretations of concepit ; 
behavior (e.g., Bourne & Restle, 19 Е 
Restle, 1962) have been concerned PF 
marily with a different case he 
wherein it can be assumed that t?" 
subject knows the rule and must гай 
tify the relevant attributes only. a 
lytic separation of attribute- and aca 
learning aspects of conceptual behavi? 
does not preclude the development 
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а theory capable of describing per- 
formance when both rule and attributes 
are unknown (CL). Yet, available 
evidence is clearly too fragmentary to 
sustain any set of strong theoretical 
assertions. 

. It may be noted that the RL model 
implies a hypothesis-testing strategy 
Which yields a simple test among al- 
ternative possible solutions. Each new 
contingency reduces the number of pos- 
sibilities until only the correct one re- 
mains. Restle (1962) and others have 
conceived of AI as proceeding along 
similar lines. "That is, the subject se- 
lects, tests, and rejects stimulus attri- 
butes (as hypotheses) until the correct 
One or combination is discovered. In 
view of the considerable success of these 
models in providing detailed, quantita- 
tive accounts of performance in simple 
Concept-identification tasks, it is rea- 
Sonable to suggest an extension to more 
complicated CL problems. 

If both the rule and the relevant 
attributes are unknown, the subject 
may (a) randomly select a pair of at- 
tributes on a provisional basis for en- 
Coding in a truth table, (b) test for 
Systematic assignment of the attribute 
Contingencies to response categories, 
and (c) reject the pair and select a 
new one if tests for consistent assign- 
ments fail, This extension implies a 
Ormal model with two independent 
ее, corresponding to discovery 
dc two unknowns. The processes 
а Bop overlap, , however, in the 
^ie that the subject must test for 
ie ematic contingency assignments to 
c) Ponse categories subsequent to the 

ection of any attribute pair. 
ed is clear that any such theory has 
the *a no validity as an account of 
"us avior of naive subjects in CL 
facts роге data show that the sub- 
Bites egin to encode relevant attri- 
a only after an appreciable amount 

Practice on experimental problems, 
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and even then only when the relevant 
attributes are given at the outset. 
Thus, the accuracy of this description 
is very likely limited to those cases 
wherein the subject has attained mas- 
tery of several rules. Observations of 
the naive subject in the present experi- 
ment are closer to expectations based 
on Hunts (1962) interpretation. 
Most subjects presume the problem 
to be conjunctive at the outset, with 
the obvious and predictable result that 
CL and AI conjunctive problems do 
not differ in difficulty. When the con- 
junctive strategy is insufficient, the sub- 
ject discovers a set of partial solutions 
which, taken together, permit him to 
attain criterion. Thus, Hunt’s model 
may provide an adequate description of 
the subject's performance on initial CL 
problems. While solution to such 
problems may be attained through the 
acquisition of a decision tree, as Hunt 
suggests, present data indicate that 
certain generic decisions in the tree— 
corresponding to the relevant rule— 
are transferable across problems, for 
evidence of learning to learn (or solve) 
is provided by both experiments. 
Since both inter- and intrarule transfer 
were observed, it seems likely that gen- 
eric decisions somehow contribute to 
the formation of an intuitive truth ta- 
ble. The nature of this acquisition 
process is, however, obscure. 


Suggestions and Conclusions 


The analyses and results presented 
here immediately suggest certain prob- 
lem areas requiring further study. 
First, and perhaps most obvious, 15 the 
exploration of parameters affecting RL 
and RI. Many of the variables which 
exert a pronounced effect in AI, such 
as number of irrelevant dimensions, 
may be expected to have little or no 
effect in RL. On the other hand, 
possibilities which are irrelevant to Al, 
such as the number of known rules, 
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may have direct and measurable effects 
in RI. : 

Second, there is a need for more rig- 
orous study of AI under rules other 
than the conjunction. Consideration of 
previous results and the present analy- 
sis suggests that a large part of the 
differences across rules in previous ex- 
periments may be attributed to differ- 
ential prior training and familiariza- 
tion. Further study in which subjects 
are thoroughly trained on the relevant 
rule, so that it constitutes a true 
"given" in the problem, should clarify 
the question of inherent rule difficulty 
in АІ. It may be anticipated that 
somewhat different strategies will be 
utilized when different rules are in 
effect, since it is apparent that those 
appropriate to conjunctive problems 
are not efficient for other rules (Bruner 
et al, 1956). Whether important de- 
terminers of conjunctive concept iden- 
tification will have a different effect 
when other rules are used, however, is 
not clear. The results of Kepros and 
Bourne (1963) with  biconditional 
problems suggest no difference in at 
least one case, that of increasing the 
number of irrelevant dimensions. But 
additional experiments to determine, 
for example, whether increasing the 
amount of intradimensional variability 
has a similar effect on AT with a vari- 
ety of rules are needed. 

Third, although previous work has 
shown that the proportion of positive 
instances has a significant effect on 
solving conjunctive problems (Hovland 
& Weiss, 1953), it is not safe to gen- 
eralize this result to all concept types 
and problems, for it may be no more 
than an artifact of the appropriateness 
of a positive focus for conjunctive stra- 
tegies (Hunt, 1962). Rules which are 
more easily solved with a negative 
focus may well show a significant su- 
periority for negative instances. Apart 
from the question of strategies, the 
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present analysis suggests that the pro- 
portion of the four stimulus «ae 
gencies may be an even more powerfu 
determiner of solution in certain prob- 
lems than the raw proportion of posi- 
tive and negative instances. i 

Fourth, the present experiments dea | 
only with rules based on the sententia 
calculus, i.e., rules reducible directly to 
expressions involving only negation, 
conjunction, and inclusive Чаўиленоп- 
Other classes of rules, expressing a 
relationships as equality, order, inc ue 
sion, and so forth, remain to be ne 
plored. It is of interest to observe es 
the subject determines the relevant ei 
lationship between given or known 2 " 
tributes, and to compare such perform 
ance with that on the logical rules us¢ 
in the present study. а 

These and other problems remain B 
be investigated. Studies reported e 
though only a beginning, attest tO т 
advantages of separating importa Fa 
structural and behavioral componen е 
in conceptual problems. They maien 
the obvious perhaps—that performan 
in these tasks is a complex aaepe Р 
thought and behavior, involving st " 
mediating mechanisms which interce Е 
betweén problem and solution. In sat 
dition, they provide some promis! E 
techniques for elucidating vm 
which may have been confounded Ed 
task requirements in previous resear - 
There is much more to be eani 
plished, however, before a grt 
and complete understanding of сп 
components, and the variables W^ 
govern them, is at hand. 
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METHOD FOR CHANGING STEREOTYPED 
RESPONSE PATTERNS BY THE 
INHIBITION OF CERTAIN 
POSTURAL SETS? 


FRANK PIERCE JONES 


Institute for Psychological Research, Tufts University 


An empirical method is described for changing habitual response 
patterns by inhibiting postural sets which disturb the reflex balance of 
the head. The procedure results in a redistribution of postural tonus 
which is reported by S as a decrease in the feeling of weight and 


in the effort needed to move, 
recorded by multiple-image p 
electromyography. Anatomical 
suggested to explain the phenome 
science are discussed. 


The problem of behavioral change 
becomes more important as society be- 
comes more complex. Even if a com- 
plete set of desirable responses could 
be successfully taught to everyone, the 
necessity would still remain, in a 
rapidly changing world, of modifying 
or eliminating some of them. This is 
easier said than done, however, as any- 
one can testify who has attempted to 
change an unwanted habit. A. well- 
learned response pattern continues to 
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Differences in posture and movement are 
hotography, X-ray photography, and 
and physiological | 
non. The implications for behavioral 


mechanisms are 


be elicited by appropriate stimuli ins 
after it has lost all value for the pipe" 
ism. “I see the better course $ 
approve of it,” the poet says, “ÞU 
follow the worse.” :rical 
This paper will describe an mmi 
method for changing habitual айа 
of behavior by inhibiting certain Рон 
tural “sets.” Once understood, tern 
method can be applied to any pat ost 
of response. It is demonstrated 2 es 
easily with a movement which invo i" 
head, neck, and trunk and which 
performed against gravity. 


PosrumarL SET 


a 

The term set is used here to nr dis 
preliminary change in the level an 0 
tribution of tension as a preparation a- 
movement (Jones, 1963). In s 
Tation for a movement against Fiten 
ity, the increase in tension 18 ber 
Sufficient to change the relation e is 
tween head and trunk, This — D 
$0 much a part of the movemen the 
come that it is seldom detected БУ 1 
mover. If it is eliminated йош 
response pattern (i.e., inhibited е 
movement itself will differ mar* 


the 
dly 
e 
. n 
from the ordinary movement 1 
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way the weight of the body is per- 
ceived. In addition, the movement will 
produce a different distribution of pos- 
tural tonus, against which a subsequent 
set can be perceived kinesthetically as 
a figure-ground relation. The charac- 
ter and significance of the phenomenon 
can perhaps be best understood from 
an introspective account. 


INTROSPECTIVE REPORT 


The effect on a movement pattern of 
changing the habitual relation hetween 
head and trunk was first demonstrated 
to me by the late A. К. Alexander.” 
Alexander used the movement from 
Sitting to standing for the demonstra- 
tion. The experience, which is still 
Vivid in retrospect, was unexpected. 
Assuming that the demonstration 
would have something to do with 
voice production, I did not anticipate 
а movement of any kind. At the start, 
Alexander made a few slight changes 
in the way I was sitting. These 
changes seemed quite arbitrary to me, 
and T could not remember afterwards 
what they were. Then, asking me to 
leave my head as it was, he initiated 
the upward movement without further 
Instructions. The movement (from 
ram to standing) was completed be- 
ore I had a chance to organize any 
voluntary response. The sensation 
ica not that of either getting up or 
eing lifted. My body seemed to be 


he A. R. Alexander was a brother of F. 
d Alexander (1923, 1932, 1941). Alex- 
Н ers teaching, which was based on a 
ене of conscious inhibition, had ап 
portant influence on the thinking of 
19507 (1922, 1923, 1932; McCormack, 
los" Coghill (1941) and Dart (1947, 
di gave firsthand accounts of Alexan- 
abl 8 system. It was noted briefly but favor- 
(rad Sherrington (1946) and Herrick 
the 9) and discussed at various times in 
19541 ritish medical press (Barlow, 1945, 
t Macdonald, 1926; Rugg-Gunn, 1940). 
tee posing account has recently been writ- 
y Carrington (1963). 
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straightening reflexly against gravity. 
The analogy of the knee jerk sug- 
gested itself immediately. The two 
movements were similar kinesthetically, 
but this one, in an integrated fashion, 
involved the whole extensor system. 

In addition to the reflex effect, the 
movement was notable for the way 
time and space were perceived. 
Though less time than usual was taken 
to complete the movement, the rate at 
which the head and other parts moved 
was paradoxically slower, and the 
trajectories which they followed were 
unfamiliar. The experience is difficult 
to describe, but the impression was 
that of a sudden expansion in both di- 
mensions so that more time and space 
seemed available for the movement. 

The sense of reduced weight which 
accompanied the movement persisted 
after it was completed. The centers of 
gravity of the head and trunk seemed 
to have shifted forward in relation to 
the feet, and the weight of the body 
to be supported by structures which 
previously had not been called into 
play. The effect of the change was to 
eliminate much of the fatigue which 
for me had been associated with stand- 
ing. 

What later became a clear-cut, easily 
recognizable kinesthetic experience of 
lightness and ease of movement regis- 
tered at first as the absence of sensa- 
tions which, though familiar, had never 
been consciously observed. The nature 
of these sensations began to be mani- 
fest, however, when an attempt was 
made to repeat the movement from sit- 
ting to standing. This time І got set 
in anticipation. There was an increase 
of tension in the neck, trunk, and limbs 
as soon as the suggestion of getting up 
was made. The tension had apparently 
always been present as a preliminary 
to movement, but by some process of 
adaptation it had passed unobserved. 
Now, against the remembered back- 
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ground of the previous movement, it 
stood out as a recognizable pattern. 
It was as if the reflex movement had 
eliminated some of the “noise” from 
the system, so that when the signal re- 
appeared it could be perceived as a 
discrete pattern of spreading tension. 
The pattern began first in the neck, 
fixing the position of the head and 
spreading quickly to the trunk and 
limbs. While it was present, no move- 
ment except the habitual seemed pos- 
sible, 

Once this signal—this pattern of 
spreading tension—had been clearly 
perceived, I found it reappearing in a 
great variety of situations—in speak- 
ing, for example, in taking a deep 
breath, in climbing stairs, in lifting a 
heavy weight, in changing the fixation 
of the eyes. The pattern was magni- 
fied by stress, but seemed to be present 
in some degree a great deal of the time. 
It was repeatedly demonstrated to me 
that when this pattern was inhibited 
and the tension prevented from spread- 
ing, the character of the associated ac- 
tivity changed: it became markedly 
easier and was accompanied by the 
same kinesthetic effect of lightness I 
had observed before, 

The kinesthetic effect of lightness 
was tonic and persisted long after a 
particular. movement had been com- 
pleted. With it almost from the start 
came certain automatic or semiauto- 
matic changes which are probably sig- 
nificant for understanding the phe- 
nomenon: (a) a change in the rest 
position of the eyes and a marked re- 
duction in the tension used in eye 
movements; (b) a change in the rest 
position of the jaw and a relaxation of 
tension in the tongue and throat; (c) 
a change in the rate and depth of 
breathing, associated with an increase 
in the excursion of the diaphragm. All 
of these changes were corollaries of 
the postural change and disappeared 
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when I returned to my habitual pattern 
of posture and movement. 

I have described a clear-cut sensory 
experience. Though kinesthetic in ori- 
gin, it had the sharp and ав 
reality of experience obtained through 
other sense modalities. It was 
peatable, and it was not accompanie 
by any detectable form of suggestion. 
I was immediately struck by the йр 
cations of the experience for behaviora 
science. It seemed to open up to p 
trolled investigation an area of the = 
which has heretofore been notoriously 
private and inaccessible and to provide 
the individual with a reliable means -— 
changing a behavior pattern ie 
delving into the past or having ^ 
course to some outside authority. т 
inhibitory control which I found mys 
using in simple, everyday movemen 
seemed capable of extending to y 
situation which could be expressed с 
terms of stimulus and response. ча 
fore speculating, however, about 17 
plications and extensions, I wp 
that experimental data were needed а 
order to define the phenomenon a 
tionaly. To this end, a series fts 
Studies have been made at the Tu h 
Institute for Psychological не 
I shall review them here and advance 


of 
theory of mechanism to account f 
the data. 


Procepures ror CHANGING THE 
Б] 
BALANCE or tHe HEAD 


The subject, seated їп а comfortable 
“relaxed” posture, is asked to straigh e 
Up into a more erect posture. As on 
Tesponds, a slight change can ре 7 
to take place in the axis of his he? 
which is usually rotated backwa a 
bringing the occiput closer tO is 
seventh cervical vertebra. If ne ег 
asked to straighten up even furt 


are 
*The procedures described hem M. 
adapted from procedures used by F 
Alexander, 
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Fic. 1. Three sitting postures, traced from 
photographs. (From left to right: habitual 
relaxed, habitual erect, and experimental. In 
the final picture the subject has been guided 
into the erect posture by the experimenter, 
who counterbalanced the backward pull on 
the subject’s head during the movement.) 


into his “greatest sitting height,” the 
angle of head rotation will increase 
(Jones, Gray, Hanson, & Shoop, 
1961), After the subject has returned 
to his relaxed posture, the experi- 
menter applies a light pressure at the 
back of his head just sufficient to coun- 
terbalance the backward rotation. The 
subject is then guided in a series of 
small movements, while the experi- 
menter continues to prevent the back- 
Ward rotation of his head, taking care 
not to evoke stretch reflexes in neck 
muscles by applying unnecessary pres- 
sure, At the end of the procedure, the 
subject will again be in a more erect 
Posture, but the relation of head to 
trunk will be different from that ob- 
served in the habitual posture (see 
Figure Ly. 

If these procedures are carefully fol- 
lowed, the subject’s weight will seem 
to the experimenter to be progressively 
reduced until little effort is needed to 
move him passively. Any movement 
carried out after the change will follow 
a different course from the correspond- 
ing habitual movement. 


ЕкЕЕСТ or Hran BALANCE ON 
MOVEMENT PATTERN 
The effect of a change in head bal- 
ance on movement pattern was first 
demonstrated by Jones and Narva 
(1955) with multiple-image photog- 
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raphy. Small electric lights were at- 
tached to the subject’s head, trunk, and 
limbs and connected with wires to a 
battery. The subject moved with pro- 
file to the camera, which was left open 
throughout the movement while an 
episcotister (or “Marey wheel”) in- 
terrupted the moving image 10 times a 
second. The images of the inter- 
rupted lights provided a time-space 
pattern of the movement. Though the 
technique lacked precision, the photo- 
graphs showed convincingly that the 
pattern of a movement is altered when 
the head balance is changed. 

Jones and O’Connell (1956) refined 
the method by attaching strips of 
Scotchlite reflecting tape to the subject 
and recording the moving image by 
repetitive strobe at rates of 5, 10, and 
20 flashes per second. The strobo- 
scopic method is highly flexible and 
allows the subject a maximum of free- 
dom to move. Patterns obtained in 
this way are sharp and clear-cut and 
contain a vast amount of information 
that can be quantified. An example 
of the method is shown in Figure 2. 
The subject wears a black jacket to 
cut down reflectance. A small cross, 
which is centered in the Frankfort 
plane (Howells, 1937) halfway be- 
tween the tragion of the ear and the 
lowest point of the orbit, marks the 
position of the subject's head and its 
angle of rotation. Other markers are 
attached over the seventh cervical 
vertebra and the sternal notch and to 
the upper and lower arms. In the 
picture on the left, the subject sits lean- 
ing forward in the chair and moves 
back into an upright sitting posture. 
The strobe is pulsed at 5 flashes per 
second. In the picture on the right, 
the movement is repeated after the ex- 
perimenter has changed the relation 
between the subject's head and trunk. 
In the experimental movement, the 
trajectory of the head is higher, the 


2a. Habitual. 


(Reflecting markers are placed on: 
sternal notch; upper and lower arm, 


curve is smoother and more regular, 
and the change in head pattern is ac- 


companied by changes in the patterns 
of the trunk and arm. 


Other everyday movements which 
have been recorded in this way are 
walking, stair-climbing, stooping down 
to pick up an object from the floor, 
straightening up from a slump, and the 
movements from sitting to standing, 
from standing to sitting, and from lying 
down to sitting up. In each of these 
movements, the pattern is characteris- 
tically altered when the relation. of 
head to trunk has been changed. 


MovEMENT FROM SITTING 
TO STANDING 
The most striking differences in pat- 
tern are seen in the movement from 


head; neck, at seventh cervical vertebra; chest, 
Strobe at 5 flashes per second.) 
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2b. Guided. 
А : : Е © rect. 
Fic. 2. Stroboscopic multiple-image photographs, from leaning forward to sitting еї 


at 


sitting to standing (see Figure 3). E: 
the picture on the left, the subject, Wi 
markers on head, neck, upper y 
lower arm, leg, and foot moves ed 
sitting to standing in his habitual Шия 
In the picture on the right, the joe 
ment is guided by the experimentet 
who has changed the relation B 
head and trunk in the manner descri 
above, z to 
In the movement from sitting di- 
standing, habitual patterns show Qn 
vidual differences which remain ial 
markably constant from trial to cd 
(Jones & Hanson, 1961, 1962). bes 
the movement is guided, individual the 
ferences tend to disappear so that ini- 
guided patterns are much more tive 
form than the habitual. Quantitat 8 
methods of pattern analysis (Jone 
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O’Connell, 1958) were used to com- 
pare guided with habitual movements. 
In an analysis of 36 patterns from six 
subjects, indexes were found that dis- 
tinguished the guided from the habitual 
at a confidence level of .01 or better 
(Jones, Gray, Hanson, & O'Connell, 
1959). The three indexes which have 
proved most useful in subsequent 
studies are: (a) head thrust, which 
measures the forward thrust of the 
head during movement ; (b) trajectory 
ratio, which measures the extent to 
which the head trajectory departs 
from a straight line: and (c) rise 
time, which measures the time needed 
to bring the head above the starting 
level. In the guided movement, head 
thrust decreased, rise time was shorter, 
and head trajectory approached more 
closely a straight line. 

The value of the three indexes was 
tested by applying them to patterns 
which could be judged “better” or 
“poorer” on the basis of some external 
criterion. Jones and Hanson (1961) 
analyzed the movement from Sitting to 
standing as it was performed by “well- 
coordinated” and "poorly coordinated" 
Subjects. Jones, Hanson, Miller, and 
Bossom ( 1963) Compared the same 
movement in a group of normals and a 
group of patients with neurological dis- 
eases. The three indexes which had 
distinguished guided movements from 
habitual distinguished (again at a con- 
fidence level of 01 or better) well 
coordinated from Poorly coordinated 
and normal from abnormal movements, 
By these criteria, then, the guided 
movements are not only different from 
the habitual, they are better, 


SUBJECTIVE EXPERIENCE 


For the great majority of subjects, 
the guided movements are accompanied 
by a kinesthetic experience of lightness 
and ease of movement, which tends to 
persist after the experimental session 
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is concluded (Jones, 1954). By some, 
this kinesthetic effect is observed im- 
mediately, i.e., after the first guided 
movement. Others observe it only 
after the movements have been re- 
peated a number of times. When 
asked to describe the experience, sub- 
jects are apt to find it difficult to gar 
their feelings into words. To make 
the task easier, a check list of 18 adjec- 
tives was constructed, 16 of them 
paired opposites (Jones, 1964). T * 
subject is first asked if the guide 

movements felt different in any respect 
from his ordinary movements. If v 
answer is “yes,” he is given the list 0 
adjectives and asked to check those 
that best describe the difference. d 

Data from 39 subjects are presente 
in Table 1. All of the subjects were 
normal young adults without previous 
knowledge of the phenomenon ew 
Study. The responses were obtaine 
after a brief demonstration of 10 or 1 
minutes, 

The adjectives most frequently 
checked (ie, lighter, less familiar, 
higher, and smoother) seem to give the 
Core of the sensory experience, Other 
adjectives reflect individual differences 
in the attempt to describe the mi 
miliar. A feeling of “unreality” ane 
Some subjects observed was variously 
identified by “brighter,” “duller,” or 
"softer." The fact that some subjects 
checked “slower,” while others checke 
"faster," seems related to the para 
doxical character of some of the guide 
movements (eg. sitting to standing, 
Which may be completed in a shorte 
time than the corresponding [дле 
movements, but which reach a lowe 
maximum velocity. Sometimes the а 
perimenter failed to convey the Kies 
thetic effect of lightness. When this 
was the case, a different constellaties 
of adjectives was checked wit 
"heavier" and "more difficult" eres 
ing "lighter" and “easier.” Some su 
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TABLE 1 
PERCENTAGE OF RESPONSES OF 39 SUBJECTS TO THE ApyecrivE CHECK List 

Adjective % Adjective % 
Lighter 72 Телѕег 20 
Less familiar 62 Brighter 15 
Higher 59 More difficult 15 
Smoother 54 Less steady 13 
Slower + Heavier 13 
More relaxed H Faster 10 
Easier ES! Jerkier 10 
Softer 38 Duller 8 
Steadier 36 Lower 3 


Jects reported that the feeling of light- 
ness was confined to the upper part of 
the body at the start and did not extend 
below the waist until a later stage in 
the demonstration, or until a later 
Session. The spread of the kinesthetic 
effect from the head downwards (it is 
never reported the other way around) 
15 undoubtedly important for under- 
Standing the phenomenon. 

To measure the subjective feeling of 
reduced weight, the method of magni- 
tude estimation has been used. The 
Subject is asked to put a value of 10 
Оп the effort he ordinarily exerts in a 
Movement against gravity ; then, on the 
Same basis, he is asked to estimate the 
pod involved in the experimental 
: Сп, Measured in this way, the 
eling of weight may be reduced any- 
where from 20 to 80%, with some sub- 
d reporting that all sense of effort 
as disappeared. 


І 
X-ray Srupy оғ тнк Heap-Neck 
RELATION 


may photography was used by 
weis and Gilley (1960) to obtain a 
bali. definition of the three postures— 
ex ual relaxed, habitual erect, and 
*perimental—shown in Figure 1. 
=A Subjects were 20 students at the 
з ‚ University School of Dental 
sta aoe who were taking part in a 
Y of dental occlusion. X-ray pho- 


tographs were taken of each of them in 
the three postures. To analyze the 
photographs, planes were constructed 
through the head and neck, and the 
angles which these planes made with 
the horizontal and with each other 
were measured. Additional measures 
were taken of the distance between the 
spines of the first two vertebrae and 
of the distance between sella turcica 
(which corresponds roughly to the 
center of gravity of the head) and a 
horizontal line drawn through the sec- 
ond vertebra. When the three pos- 
tures were compared, it was found that 
both of the erect postures differed 
from the habitual relaxed in the angle 
between horizon and neck. They dif- 
fered from each other in the angle be- 
tween horizon and head, and in the 
angle between head and neck. In the 
experimental posture, the distance be- 
tween the first two vertebrae was 
greater, and the distance between sella 
turcica and the second vertebra was 
smaller, than in the habitual erect. 
"These differences are significant at the 
:01 level or better. 

X-ray photographs of the two erect 
postures are shown in Figure 4. In 
the experimental posture (on the 
right) the neck has lengthened, the 
head has rotated forward on the atlas, 
the distances between the spines of the 
vertebrae and between the vertebrae 
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and the occiput have increased. Though 
the head is slightly higher in the ex- 
perimental posture than in the habitual, 
its center of gravity relative to the 
vertebrae is lower. 


ROLE oF THE STERNOMASTOID 
MuscrE 


In the light of the X-ray findings, 
it seemed reasonable to expect a dif- 
ference in the activity of neck muscles 
in the two erect postures. Preliminary 
investigation pointed to the sterno- 
mastoids (the prominent diagonal 
muscles on either side of the neck) as 
the muscles which distinguished most 
sharply between the two postures. 
Later, the postural activity of these 
muscles was systematically studied in 
seven male subjects between the ages 
of 16 and 21 (Jones, Gray, Hanson, & 
Shoop, 1961; Jones, Hanson, & Gray, 
1961). Surface electrodes were placed 


over the right sternal head halfway 


between origin and insertion. The ac- 


Fic. 4. 


| 


Х-гау photographs of two erect sitting 
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tivity of the muscle was recorded as 
the subject sat in his "most comfort- 
able" posture, in his "best" posture, 
and at his “greatest height." It Wo 
recorded again when he was guided 
into the "experimental" posture. Mean 
potential for the seven subjects in- 
creased from 7 microvolts for most 
comfortable to 10 microvolts for best 
and to 26 microvolts for greatest sit- 
ting height. In the experimental pos 
ture it dropped back to 6 microvolts. 
The difference in the behavior of the 
sternomastoid appeared not only !? 
the postures themselves but also in the 
movements which led into them. 
differences (which are illustrated x 
bar graphs in Figure 5) are significan 
at a confidence level of .05 or better. 


ANATOMICAL MECHANISMS 


The head can be rotated, tilted, i 
lowered by contracting the muscles 2 
tached to it; but it cannot be lifted. 
can be lifted only by straightening ап 


sental) 
postures. (A, habitual; B, experiment 
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MICROVOLT 
N 


BSP GSH 
POSTURE 


MCP 


Fig. 5. 


Postural activity in right sternomastoid muscle. 


Exe H G 


MOVEMENT 


(Means for seven subjects. 


Abbreviati 
В ops are: MCP, most comfortable sitting posture; BSP, best sitting posture; GSH, 
sitting height; Exp, experimental; H, habitual movement into erect sitting posture; 


G i à 
3, guided movement into erect sitting posture.) 


ааа the cervical spine. Ве- 
-s vs the vertebrae and the skin there 
ined Structures whose joint action 
m ie the height of the head and 
а а ng e of tilt and rotation. They are 
own schematically in Figure 6. 


Intervertebral Discs 


"ess Pa en discs between the 
and аге the vertebrae contain fluid 
aes pp of exerting hydraulic 
len v bony structures around 
i ipeo 1942, p. 281). The discs 
ку к, under pressure by ligaments 
sal y MA muscles, including the 
нее в чаан which run from опе 
Shoren a the next. If these muscles 
tibi] ^ 2 distance between the ver- 
ise b 5а. will be lessened, and the 
versely. ч further compressed. Соп- 
will HA they lengthen, the distance 
Бете азе аза result of the released 
vertebral E: the discs. In the inter- 
Уй, Soi е, is a mechanism 
Wires a the height of the head can be 

a small but significant amount. 


Flexor. 
7075 and Extensors of the Neck 


Bot > 
Te convexity (extension) of the 
tion (A curve and its forward inclina- 
exion) are countered by the 


tension of ligaments and muscles whose 
origins and insertions are on the verte- 
brae themselves (Gray, 1942, pp. 390, 
394-395). Acting together, the flex- 
ors and extensors of the neck straighten 
and strengthen the cervical spine, turn- 
ing it into a column of support and, in 
so doing, increase the height of the 


head.* 


4In standard textbooks of anatomy, the 
extensors of the head and the extensors of 
the neck are treated under a single heading 
and illustrated with the same plates so that 
it is sometimes difficult to distinguish one 
group of muscles from the other. They have 
similar names (splenius, longissimus, etc.) 
and similar origins on the spinal column. 
They differ, however, in their insertions and 
in the functions which they períorm. When 
the neck is flexed, the head is brought for- 
ward and down. If, then, the neck extensors 
contract, the spine will be straightened (ex- 
tended), and in the process the head will be 
brought to a higher level, even though the 
head extensors remain relaxed. Contracting 
the head extensors, оп the other hand, will 
not lift the head, but will only tilt it back- 
ward so that the occiput approaches the 
seventh cervical vertebra. Duchenne (1959, 
p. 534) first pointed out in 1867 the differ- 
ence of function between the two sets of 
extensor muscles. He had observed that 
patients whose neck extensors were paralyzed 
could not lift their heads, though they still 
had the full use of their head extensors. 
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Fic. 6. Anatomical structures affecting the height and angle of the head. 
vertebral disks with interspinales and intertransz 


of the neck. C, flexors and extensors of the head. D upper trc 


mastoid muscles.) 


Flexors and Extensors of the Head: I 


In the upright posture, the head is 
in unstable equilibrium, its center of 
gravity forward of the base of support 
(the atlas), and its weight balanced by 
the tension of ligaments and muscles 
which run from insertion points at the 


D 
(A, intet- 
5 А, ш 
versartt muscles. В, flexors and exten d0- 
apeziws and sterno" l 
d joins 
base of the skull to various orig : 
mtt 


along the spinal column. These " ge- 
cles, the extensors of the head, ате he 
signed to balance, in addition tee a 
weight of the head, the tension i 
smaller group of flexor musel pt 
front. By the lengthening and * 
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ening of the various muscles in these 
two groups the head can be moved in 
many directions without destroying the 
equilibrium of forces (Gray, 1942, pp. 
386-395). 


Flexors and Extensors of the Head: 1T. 
Sternomastoid and Upper Trapezius 


There are two pairs of flexors and 
extensors which connect the head di- 
rectly with the shoulder girdle. They 
are the sternocleidomastoids (Gray, 
1942, p. 384) and the upper trapesii, 
which form a blunted isosceles triangle 
at the back of the neck (Gray, 1942, p. 
428). Simultaneous contraction of the 
two pairs of muscles will bring the head 
closer to the shoulder girdle and in- 
crease the curve of the cervical spine, 
the sternomastoid drawing the head for- 
ward and down, the upper trapezius 
Preventing forward rotation by retract- 
ing the occiput.’ In the process, the 
origins and insertions of the muscles 
and ligaments which maintain the 
"wp ds of the head against gravity will 
зе brought closer together. 


N 


STARTLE PATT 


An example of the joint action of 
ы ешын. and upper trapezius 
Fio be seen in the “startle pattern" of 
е 7. Surface electrodes have 
Wis Pec to the subject's skin 
ber үч right sternomastoid and right 
апан тарози. In Figure 7A he is 
inae ^g in his "most comfortable pos- 
Startled In Figure 7B he has been 
dbor ed by the sudden slamming of a 

- The two sets of muscles have 
The head 
ST 


hun t be significant that the two muscles 
he same phylogenetic origin (Romer, 
de 285) and that, unlike other neck 
on ir they receive their principal inner- 
еһе а cranial nerve (the accessory). 
xtreme е (1959, р. 5) commented on their 
Which h Sensitivity to electrical stimulation, 
ter or qi. Mtributed to the peculiar charac- 
Meir nerve supply. 


Сот А 
itracted simultaneously. 


mus, 
vati 


is thrust forward, but the Frankfort 
plane remains horizontal The pos- 
tural change does not stop with the 
head and neck. As in one of the "atti- 
tudinal reflexes" described below, the 
shoulders are lifted, the chest is flat- 
tened, the legs are flexed, and the arms 
are extended. 

The startle pattern, which was 
studied with high-speed photography 
by Landis and Hunt (1939), provides 
a vivid example of how “good” posture 
can change to “bad” in a very brief 
time. The pattern itself is typical of 
bad posture in general, whether it is 
the result of age, disease, or lack of 
exercise (Jones, Hanson, & Gray, 
1964). In the startle pattern, the ac- 
tive character of malposture and the 
sequence of events by which it comes 
about can be clearly observed. The 
response is not instantaneous. It be- 
gins in the head and neck, passing 
down the trunk and legs to be com- 
pleted in about 4 second. The neck 
muscles are central in the organi- 
of the response. Jones and 
Kennedy (1951), who studied the 
startle pattern by multiple-channel 
electromyography, placed surface elec- 
trodes in various locations on the sub- 
ject’s neck, trunk, and limbs, with one 
pair always over the upper trapezius. 
The intensity of the stimulus was 
varied from the sound of a dropped 
book to the sound of a .32 caliber re- 
volver. Sixty patterns were obtained 
from eight subjects. In all cases when 
the stimulus was strong enough to 
elicit a response, it appeared in the 
neck muscles; in many cases, the re- 
sponse appeared nowhere else. 


zation 


PHYSIOLOGICAL MECHANISMS 


I have described two reciprocal sets 
of structures in the neck, one de- 
signed to straighten the cervical spine 
and move the head out from the trunk, 
the other designed to bring head and 
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Sus. бн SLE : . “most 
Fic. 7. Activity of neck muscles in the startle pattern. (A, subject standing 11 шо 


‚ н n 
comfortable posture." B, subject after door slam. EMG records: above—upper trapes" 
below—sternomastoid. ) 
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trunk closer together and increase the 
spinal curvature. In the procedure 
described earlier in this paper, the 
downward pull of the sternomastoid 
and upper trapezius was counterbal- 
anced by the experimenter while the 
subject moved. It is conjectured that 
in the process, the pressure on the discs 
was reduced, the flexors and extensors 
of the neck shortened, and a lengthen- 
ing took place in the flexors and ex- 
tensors of the head and in the small 
muscles between the vertebrae. Dur- 
ing movement, the lengthening which 
appeared in the neck could be ex- 
pected by purely mechanical means to 
extend along the rest of the spine, with 
the force being transmitted by muscle 
and ligament from one vertebra to the 
next, 


Stretch Reflexes 


The mechanical effect of stretch on 
muscle is modified by the action of the 
nervous system. When a muscle is 
Stretched, it contracts reflexly, and 
the strength of the contraction is pro- 
Portional to the stretch. Though 
Stretch reflexes occur at the spinal 
level, they are under the influence of 
higher centers in the nervous system. 
This influence is transmitted. directly 
to the muscles through the system of 
Small motoneurons, the gamma effer- 
ents. As the level of activity in the 
каца efferents rises ог falls, the 
се of a muscle to stretch in- 
Bi or decreases. Whether a con- 

‘ing muscle shortens or lengthens 
ed m lowering a weight), the stretch 
Shen persists as long as the stretch 
of "я us is applied. Though the range 
ba a over which muscles can 
Opina] às considerable, there is an 
partien] resting length” at which a 
tens: ular muscle exerts maximum 

lon, 
s Seems reasonable to suppose that 
effect of the experimental pro- 
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cedures is to reorganize stretch reflexes 
in the neck and back so that more 
muscle fibers take part in the antigrav- 
ity response, and that the muscles in 
contracting remain closer to their opti- 
mal resting length. This effect would 
account for the subjective feeling of 
reduced weight while maintaining a 
position as well as for the apparent lack 
of effort in movements performed 
against gravity. 


Head-Neck Reflexes 


The kinesthetic phenomenon and the 
change in the movement pattern can- 
not be explained entirely by the be- 
havior of individual muscles. Such 
an explanation does not account for the 
major role which the head and neck 
appear to play in facilitating the move- 
ment, nor for the secondary changes, 
e.g., in breathing, reported by subjects. 

In the posture of animals, it is well 
established that the most important 
mechanism of control is the head-neck 
relation. Magnus and his associates 
(Magnus, 1924; Magnus & de Kleijn, 
1912; Rademaker, 1931), in a long 
series of carefully controlled experi- 
ments, showed that the position of an 
animal’s head in space or its position 
relative to the body can affect the 
distribution of tonus in its neck, back, 
and limbs. Magnus summed up the 
principle by saying that in posture and 
movement the head leads and the body 


follows. . 
In classifying his material, Magnus 


used two categories—the “attitudinal 
reflexes” and the “righting reflexes.” 
The attitudinal reflexes with recep- 
tors in the labyrinths and in the joints 
of the neck (McCouch, Deering, & 
Ling, 1951) are used by an animal to 
maintain a position that is assumed for 
some special purpose, like that of a 
cat drinking from a saucer or looking 
up at a piece of meat held high. The 
position taken by the head imposes on 
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the rest of the body an attitude which 
is maintained as long as it is functional. 
These attitudes are quite stereotyped 
and follow regular rules. According to 
Magnus, they are the most enduring 
and untiring of reflexes. 

The righting reflexes take over 
when an animal is ready to return to 
the normal upright posture. The 
fixed position of the head is released; 
the imbalance of parts, registering at a 
postural center in the brain, initiates 
the righting response; as in the atti- 
tudinal reflexes, the head leads and the 
body follows. The mechanism is most 
strikingly demonstrated when a cat is 
held on its back in the air and then 
dropped. The instant it is let go it 
begins to right itself. The head turns 
first. As it turns, the tensions in the 
neck, back, and limbs are progressively 
altered. The body is twisted around, 
and the cat lands on its feet. (Ma- 
rey’s photographs of the phenomenon 
are reproduced by Chatfield, 1957, p. 
205.) 

Secondary effects of the head-neck 
reflexes on respiration, circulation, and 
eye position have also been demon- 
strated. In diving birds and mammals 
there is a highly developed set of re- 
flexes which stops breathing and slows 
down the heart in order to conserve 
oxygen (Irving, 1939; Scholander, 
1963). In ducks, this reflex is strongly 
reinforced by the change in the rela- 
tion of head, neck, and body that takes 
place in diving. Huxley (1913) found 
that without immersing a duck in 
water, breathing could be stopped by 
bringing the head and neck into the 
diving position or by placing the duck 
on its back and dorsiflexing the head. 
Conversely, breathing was at once re- 
stored if the head was brought back to 
the normal posture. A similar postural 
mechanism to stop breathing and slow 
the heart was later demonstrated in 

diving mammals (Koppányi, 1929). 


A. de Kleijn (1920) used monkeys to 
demonstrate the effect of head-neck te- 
flexes on the position of the eye in the 
orbit. He showed that the eyes moved 
down when the head was dorsiflexed 
and up when it was ventriflexed, and 
that the same shift in eye position se 
place when the head remained fixe 
and the trunk was rotated so as to 
bring the back closer to the occiput oT 
the chest closer to the chin. 


Head-Neck Reflexes in Man 


In human beings, the influence of 
head-neck reflexes is masked by par 
terns of voluntary activity. The mec? 
anisms are clearly present, hoe 
They have been frequently ge 
strated in infants (Gesell, 1954; te 
рег, 1963),° young children (Lane нт 
1923; Schaltenbrand, 1925), patient 
with neurological diseases (Simon 
1923; Walshe, 1923), and in norma 
adults (Hellebrandt, Schade, & poca 
1962; Tokizane, 1951; Wells, 194 н 
A large number of drawings and a 
tographs to illustrate the patterns É 
head-neck reflexes as they manifes 
themselves in dancing, sport, pos 
everyday activity were brought 
gether by Fukuda (1957). at- 

Various specific reflexes, both Je- 
titudinal and righting, have been chan 
fined in the literature. Rather p? 
describe them in detail, I should lik 
here to emphasize what each E РУ 
reflexes has in common. In the а to 
tudinal reflexes, the head is drawn ү > 
a fixed position and tonus is ° 
tributed in the trunk and limbs. (he 
the righting reflexes, again under 
influence of the head, the norma ese 
tribution of tonus is restored. Th 


5 

two mechanisms will account ом 

"T 

ê The secondary effect of differences ү the 

trunk-to-head relation on the position 0 by 

eyes, which was demonstrated in s ud jn 
de Kleijn (1920), was demonstra 

human infants by Voss (1927). 


the 
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economically for the phenomena which 
have been described in this paper. The 
sitting or standing posture of the aver- 
age person functions like an “attitude” 
which has been imposed on the body 
by the head. The procedures em- 
ployed in the experimental movements, 
by releasing the head from its habitual 
attitude, facilitate the righting reflexes 
and bring the subject into a different 
Orientation within the gravitational 
field. The changes in breathing, in cir- 
culation, and in the use of the eyes, 
which are sometimes reported, take 
Place automatically by reflex facilita- 
tion when the head moves into its new 
relationship to the trunk. 


PSYCHOLOGICAL CONSIDERATIONS 


I have described a mechanism by 
Which the antigravity response can be 
facilitated. Tt is a mechanism which 
ordinarily operates below the level of 
Consciousness. Magnus was emphatic 
that the righting reflexes are subcorti- 
cal and inaccessible to direct conscious 
Control: An indirect control can be 


7 Magnus (1925): 

It seems to be of the greatest im- 
Portance that the whole central apparatus 
for the righting function (with the only 
exception of the optical righting reflexes) 
is placed subcortically in the brain stem 
ae by this means withdrawn from volun- 
Y action. The cortex cerebri evokes 
enp ordinary life a succession of phasic 
по тешешв, which tend to disturb the 
ee resting posture. The brain-stem 
dist Tes will in the meantime restore the 
the urbance and bring the body back into 
tic; Эта! posture, so that the next сог- 
ton impulse will find the body prepared 
wart again [p. 349]. 

“agnus (1930): 

a Ve have a subcortically acting 
Pparatus which controls and adjusts the 
Position of our body, whether erect or 
cone ae in relation to space. This un- 
Operi y acting mechanism, by the co- 
биг Saon of complicated reflexes, restores 
it ody to the normal position whenever 

is displaced [p. 103]. 
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established, however, if the subject 
learns to recognize and inhibit maladap- 
tive postural sets which interfere with 
the response of the organism to gravity. 

In the course of motor learning, sets 
may be developed which are not the 
best preparation for the movement to 
come. They may, in fact, hamper the 
execution of the movement. Unfor- 
tunately, like some of the “supersti- 
tious" responses which appear during 
conditioning experiments, such sets do 
not extinguish readily. The organism 
adapts to them quickly; they come to 
"feel right"; and they remain unde- 
tected, because once the stimulus to 
move has been received, attention be- 
comes focused on the goal to be 
reached. 

One of these inappropriate sets is 
the tendency to shorten certain neck 
muscles as a preparation for a move- 
ment against gravity. It has been 
demonstrated that such a movement is 
facilitated when the preparatory short- 
ening of muscles is prevented by the 
experimenter. If the subject becomes 
aware of the tendency, he can learn to 
prevent it and thus establish an in- 
direct control over the postural mech- 
anism. In my experience, the only 
satisfactory way to achieve such a con- 
trol is to reorganize the field of atten- 
tion, so that when a stimulus to move 
is received, the focus of attention re- 
mains within the organism. This does 
not mean that the goal is excluded from 
attention; it means that the goal is 
not allowed to dominate the field. At- 
tention is organized around the head- 
trunk relation, with extension in time 
and space so that both the stimulus and 
the response can be comprised within 
the same field. 

Ordinarily, attention is directed 
either outward to the environment or 
inward to the organism itself. The 
central nervous system, however, is re- 
ceiving information about movements 
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and positions of the body and its parts 
at the same time that it receives infor- 
mation about events in the world out- 
side. There is no reason why the field 
of attention cannot be organized in the 
same way. In such a field, the relation 
of the head, neck, and trunk, kines- 
thetically perceived, forms the back- 
ground against which events outside 
and inside the organism take place. 
Thus it is possible to perceive an ob- 
ject simultaneously with the organism’s 
reaction to it, since both are comprised 
within the same field. 

Perceptually, objects are known to 
vary with the psychological context in 
which they are perceived. A staircase, 
for example, is perceived differently 
depending on whether it is to be 
climbed or merely to be looked at. 
The difference, of course, lies within 
the perceiving organism. If it is a 
staircase to be climbed, it may elicit a 
postural set which is so marked that 
it can be detected by an outside ob- 
server. The set can be detected by 
the climber himself only if he reor- 
ganizes his field of attention so as to 
take in both the staircase and his reac- 
tion to it as he approaches and climbs 
it. With this shift in the focus of at- 
tention, he can perceive the cause-and- 
effect relation between the stimulus 
(the staircase) and his immediate re- 
sponse (the postural set). If he takes 
an experimental approach, he can de- 
vise a means to inhibit the set while 
continuing to make the specific re- 
sponse (climbing the stairs). In the 
process, the antigravity response will 
be facilitated in the same way as in the 
guided movements which were de- 
scribed above. 

Climbing a staircase was used to 
illustrate a principle. Any activity— 
reaching for a pencil or making a 
speech—would have done as well. The 
principle of inhibition can be applied 
to any movement. A movement pat- 
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tern is a complex whole which, for 
convenience, may be thought of as 
having two aspects or parts: a specific, 
goal-directed part, and a tonic or pos- 
tural part, by which the integrity of 
the organism is maintained while the 
specific response is being carried out. 
The tonic or postural part of а movè- 
ment is not ordinarily perceived. Ee 
sional patterns which interfere with 
the smooth performance of movement 
can be perceived, however. If they are 
inhibited, postural tonus is „тше, 
tributed, and the specific. goal-directe, 
response becomes easier. In contras 
to the ease of the facilitated movement, 
old ways of moving come to feel wrong 
and a new sensory standard is 87а 
ually established. в 

In the paradigm of postural shae 
which I have just outlined, inhibition 
is the basic principle. Inhibition к _ 
term which has been used by psycho e 
gists and physiologists in a variety ад 
meanings (Diamond, Balvin, & n 
mond, 1963). I have used it here n 
describe a process by which a p 
consciously refrains from making а p 
sponse which he could make 2 he 
chose. In this sense inhibition is Ё 
central function of 


ions 
a nervous system which, when it functia 
well, is able to exclude maladaptive nc 
without suppressing spontaneity [Diam 
et al., p. 395]. 


In the presence of inhibition, 2 “үт? 
lus should elicit only a generalized ап” 
crease of alertness, leaving the M 
ism free to respond or not respon 
The principle of inhibition, as it 
been developed here, offers a ei 
Proach to the problem of behar e- 
change. In the close connectio? 5 
tween inhibition and postural ton" 
a mechanism which not only reves n 
inner pattern of a stereotyped por 
but brings it under conscious COP pe 
In so doing, it greatly enlarge? 


has 
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area of behavior where free choice can 
operate. 
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COGNITIVE DEPENDENCE ON LINEAR 
AND NONLINEAR CUES! 


KENNETH R. HAMMOND ax» DAVID A. SUMMERS 


Institute of Behavioral Science, University of Colorado 


Analysis of the cognitive process of inductive inference should focus on 
inferences drawn from nonlinear as well as linear relations. Brunswik's 
probabilistic functionalism is demonstrated as a conceptual and method- 
ological framework within which this question can be investigated. 
Analysis of Ss’ utilization of nonlinear relations is illustrated by studying 
30 Ss in the following task: (a) one cue related in a linear, the other in a 


nonlinear manner to a criterion, 


fectly, predictable from either cue alone, 
ate utilization of both. Results indicate that 


performance and nonlinear data utilization, 
d with task-relevant instructions. 


predictable from appropri 
Ss can improve both overall 
and that performance varie 


ana ge is concerned with the 
inductive i the cognitive process of 
the um inference; it is focused upon 
issues tav DIR and conceptual 
the proc olved in the investigation of 
draw hen by which human subjects 
inear ue from linear and non- 
Cussed SONS. The issues are dis- 
i within the context of clinical 


Infer 

ence and ; ity 
m - y 

earning, а ultiple-cue probability 


CLINICAL INFERENCE 


Int á 
Hates] a in the clinical versus sta- 
al prediction issue has shifted 


Some 

У . 

Te ae from a concern with the 
(Meurs Qccuracy of the two methods 


Tee 

€ узы 1954; Sarbin, 1942) to а more 
clinica] analysis of the process of 
1 inference, or to what Meehl 
“The Cognitive 


th ‘arch reported here was under- 
Institute e Behavior Research Laboratory, 
of Color: si Behavioral Science, University 
nstitute 9, and is Publication 51 of the 
pesearc a research was supported by 
үзе of the M-4977 from the National 
© general Mental Health. It is a part of 
OC esse. Program of research on inferential 
вед Bee, in progress at the Behavior 
w ee The authors are in- 
Son Frederick J. Todd, Cameron 
ance, and Richard essor for their 


has called 
“T 


he 
take, < lese 


(b) the criterion partly, but not per- 


and (c) the criterion perfectly 


Activity of the Clinician.” To what 
extent this process enables human 
subjects to utilize nonlinear as well as 
linear relations has become a focal 
point in this issue (cf. Meehl, 1960, 
p. 24). For although test manuals 
and clinical teaching emphasize the 
nonlinearity of certain cue-criterion 
relations? there has been virtually no 
effort made to discover to what extent 
clinicians or other human subjects can 
effectively utilize such nonlinear rela- 
tions in making inductive inferences. 
Indeed, studies of clinical inference 
strongly suggest that it is no more 
than a linear cognitive process. Thus, 
Todd (1954), Hammond (1955), Hoff- 
man (1960), Grebstein (1963), Hursch, 
Hammond, and Hursch (1964), Ham- 
mond, Hursch, and Todd (1964) have 
found that linear (i.e., multiple re- 
models are sufficient to 
the process. Also, Newton 
d Lee and Tucker (1962) 
this to be the case in 
asks using college stu- 
dents as subjects. Anderson (1962) 
Ainsworth, Klopfer, and Holt 


(1954), for example, suggest that a nonlinear 
relation exists between W% on the Rorschach 
and organizational effort when they indicate 
that both high and low values of W% reflect 
“Tittle effort to organize experience [p- 3007." 


gression) 
account for 
(in press) an 
have shown 
quasi-clinical t 


? Klopfer, 


915 
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likewise reports high predictability of 
a linear, additive model in impression- 
formation studies. 


PROBABILITY LEARNING 


Consideration of the results of 
multiple-cue probability studies leads 
to a similar conclusion; studies by 
Smedslund (1955), Summers (1962), 
Uhl (1963), Rappoport (1963), Todd 
and Hammond (in press), and Peter- 
son, Hammond, and Summers (1964, 
in press) also show high linearity in 
the response system of subjects. 

It must be emphasized, however, 
that in the above studies of clinical 
inference and learning the task itself 
has involved primarily linear rela- 
tions; there were no compelling rea- 
sons for the subjects’ response sys- 
tems to be anything but linear. In 
view of Brunswik's (1956) admoni- 
tion that tasks should be representa- 
tive of a wide range of conditions, the 
performance of the subjects in situa- 
tions involving nonlinear as well as 
linear relations should be investigated 
before concluding that the process of 
inductive inference is primarily linear. 
In short, having discovered what ap- 
parently is a Strong tendency for 
human subjects to utilize the data 
from linear relations in a highly linear 
manner, it remains to investigate 
whether subjects utilize the data from 
nonlinear relations in a linear and/or 
nonlinear manner. 

We need not restrict ourselves to 
the question of whether human sub- 
jects can utilize nonlinear as well as 
linear relations, however; we may go 
so far as to ask whether humans can 
combine both kinds of data and utilize 
both linear and nonlinear cognitive 

processes in the same task. That is, 
can human subjects combine data 
from linear and nonlinear cue-cri- 
terion relations concurrently in such 
a way as to make appropriate use 
of both? To the best of our knowl- 
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edge, this question has never s 
investigated empirically. 16 pes 
investigation, however; even brie 
reflection suggests that it is a question 
which lies at the heart of a large 
number, possibly a majority, of clin- 
ical inference tasks, as well X ne 
ence tasks in general. For if the 
clinician always combines data in 4 
linear manner irrespective of the Jorm x 
the data, the only difference in ac 
racy of prediction between the ame 
ian and, say, a multiple regressio 
equation would lie in differentia 
approximation to optimum weighing 
of cues. Moreover, it would be u B 
reasonable to assume that inference 
required for adaptation to the ma 
ecology neatly separate themsei Md 
into two mutually exclusive kin a 

those with only linear relations а : 
those with only nonlinear relatio 
A first step toward an empiric 


. x Ж сеп 
analysis of this problem is undertak 
here. 


CONCEPTUAL AND METHODOLOGICAL 
FRAMEWORK f 
Fundamental to the problem Be? 
investigating the question of ne 
and nonlinear inference is LA d- 
choice of a conceptual and met hs 
ological framework, and (b) a yt 
of analysis, The purposes of hat 
article are two: To demonstrate e 
Brunswik's (1956) theory of Ue 
abilistic functionalism makes it pet 
sible to investigate the question vil 
to show that a method of ana л 
developed within that framew' 
makes it possible to increase 
knowledge of subjects’ abilte . 
Cope with linear and nonlinear inlY 
ur purposes, therefore, are m? 
methodological, ша 
Pace does not permit a гесарі 952s 
tion of Brunswik’s approach (1085, 
1955, 1956; see also Hammond, ik's 
for an application of Brunet em 
theory and method to the pr? 


5- 
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Achievement (r) 


Distal 
meo TORIS 
Estimate 


Cue utilization (rg |) 


QO ® © 


Ecological validity (rei) 
Cues (Xi) 


Fic. 1. Brunswik's lens model. 


gay inference). Figure 1, how- 
a shows how Grebstein's (1963) 
udy of clinicians’ efforts to predict 
4 from Rorschach protocols may 
im серне, in the Brunswik- 
the ens model. Thus re indicates 
correlation between cues (e.g. 
matey” a the variable-to-be-esti- 
согын О), апа r, indicates the 
cliniej ion between cues and the 
im an's estimate of IQ. Further- 
е, a multiple correlation coefficient 
id ur be calculated between cues 
brad Ы criterion variable, and a 
en pe correlation coefficient (R.) 
and Ке calculated between the cues 
the € clinician s estimates (У,) of 
iudi une Me (Y). Finally, ж 
оге] achievement, that. is, the 
^ ion between the clinician's 
&ment and the criterion values. 
ee ze above framework, Hursch 
analysis 964) made a mathematical 
^ Ee of the limits of achievement 
Yelp sh arious conditions. Their anal- 
A м that achievement (ra) is 
cluded by the components in- 
~in the following equation: 


non RÈ + R? —>4 


2 
+ Сү = RSA — Rê) [1] 


where 


та = the correlation between subject's judg- 
ment and the variable estimated 

R, = the multiple correlation between the 
cues and the variable estimated 

К, = the multiple correlation between the 
cues and subject’s judgments 

xd =the sum of the products (r4 — r4) 
(Be; — б) where re; = the correlation. be- 
tween cue; and the variable estimated, 
т. = the correlation between cue; and sub- 
ject's judgment, Be; = the beta weight for the 
correlation between cue; and the variable esti- 
mated, and &,; = the beta weight for the cor- 
relation between cue; and subject's response 
C — the correlation. between the variance 
unaccounted for by the multiple correlation 
in the ecology and the variance unaccounted 
for by the multiple correlation in subject’s 
response system 


One further point needs to be made 
concerning the above equation. It 
should be noted that the first term on 


the right 
R2+ К? — Ed 
2 
denotes the linear component of 
achievement (ra). The second term 
on the right 
ceva = RAA — К) 


denotes the nonlinear component of 
achievement. The sum of the twa 
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terms, of course, equals total achieve- 
ment expressed as za Thus the 
performance of subjects may be 
investigated to determine whether 
achievement is mainly a function of 
the linear component or the non- 
linear component.? 

When this method of analysis was 
applied to Grebstein's (1963) data, 
it was clear that because (a) R was 
rather high (.79), (b) the average R? 
for the clinicians was also high (above 
-90 for all 15 clinicians studied), and 
(c) the Zd value was rather small, 
the clinicians could hardly have im- 
proved their performance by attempt- 
ing to discover and utilize whatever 
nonlinear relations existed in the 
Rorschach IQ system (see Hammond 
et al, 1964, for complete details). 
When the above analysis was applied 
to Todd's clinicians, who were also 
attempting to predict IO from the 
Rorschach, it showed that the achieve- 
ment of these clinicians was hampered 
by attention to nonlinear character- 
istics of the task (see Hursch et al., 
1964). In theabove studies, however, 
response to nonlinear relations is a 
somewhat ephemeral notion. [ts 
statistical denotation is clear, but its 
empirical meaning is not, 

Statistically, it is the term C in the 
above equation which refers to non- 
linearity. The C indicates the correla- 
tion between the residual variance in 
the clinician's response system and in 
the test-criterion System, that is, the 
relation between the Variance not 
accounted for by the linear relations 
measured by R? and R3, Inasmuch 
as C reflects the appropriateness of the 
nonlinear or configural aspect of the 

clinician's response system, the value 
of this measure represents the only 
possible advantage the clinician may 

3 See Tucker (1964) for a detailed discussion 

of the analysis of linear and nonlinear com- 
ponents of achievement. 
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have over the traditional multiple 
regression equation, which, of course, 
describes only linear relations. | 
Empirically, however, the st) 
(C) is yet unclear. In the Todd (19 id 
and Grebstein (1963) studies, for E 
ample, the value of C does not tell u 
which of the several possible nonlinear 
relations in the Rorschach IQ system 
were correlated with which nonlinear 
relations in a clinician's response 5у5- 
tem. Therefore, an experiment 19 
called for in which it is not only bee 
sible to ascertain whether the suber 
can combine both linear and дөй on 
data, but also one in which it will te 
possible to analyze and evalua a 
quantitatively the subject's ren 
dependence on both linear and ua 
linear cue-criterion relations whe 
they are under experimental а 
Such an experiment and its resu 
will be described below. y 
The present study deals with a и 
Simple two-cue situation ae 
both linearity and nonlinearity: Lac 
task involves the following propertie 


1. Criterion partly, but not p 
fectly, predictable from either js ; 
dictor taken alone (cf. Horst, 195%) 
Meehl, 1950); 

2. One predictor (X1) 
related to the criterion, 
dictor (X.) nonlinearly related 
criterion ; le if 

3. Criterion perfectly predictabe g 
probabilistic cues combined accor Хі 
to the appropriate rule: Y = 
+ sin X,.4 R in 

4. Fifty percent of the varia at 
the criterion determined by the 10 
Predictor, 50% of the variance 1” 


linearly 


re- 
ne P 
о to the 


ase 
" Е phas! 
‘Sin X, is ап approximation to оле апі 
of a sine curve. Using whole пише! 0 


restricting the range of transformation 


the 
ma u 
digits, t 


a „ аг : 
g he correlation between X2 шо! 
criterion is not exactly .00, The contri igible* 
of this cue to the Кё, however, is neg > 
approximately 01, 
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criterion determined by the nonlinear 
predictor. 


The first step was to ascertain the 
empirical validity of the components 
of the equation set forth above, with 
Particular reference to C, the measure 
of nonlinear relations between the 
task system and the response system. 
For this purpose, an analysis of the 
Performance of hypothetical subjects 
who pursue various types of response 
Strategies was carried out by com- 
Puter simulation (see Table 1). 
ee it is doubtful that in- 
eri uals ever follow specific inference 
thes egies with complete consistency, 
ien е simulated strategies serve to 

елпеа{е the components of a given 
Strategy, as well as its predictive 
н sone i Thus, perfect utilization of 
eee the linear or the nonlinear cue 
So% p. account for approximately 
ir ы the criterion variance; higher 
utilizes bo possible only if the subject 
Pa S both types of relationships. 

г instance, an achievement (ra) of 
weight паш even if the cues are 
ins e incorrectly, provided the 
бг а nonlinear specification is 
ван, Achievement. of 1.00 is 
Gee ee only if the statistical proper- 

E the task are matched perfectly. 
бу yen marked out some ideal 
Ypes of performance and shown the 


s TABLE 1 
IN " 
SUAM ED RESPONSE STRATEGIES AND 
апо. RESULTANTS (FOR THE 
ASK Y, = X, + SIN Xs, AND 


te = 11, freq = —-11, 
= Rè = .51) 
ES = 
Resultants 
Tet тез R? [^ ға 
1.00 00 | 1.00 .00 71 


:55 .32 | 1.00 .00 .64 
71 лі 1.00 .00 42 
.95 | —.05 .90 | 1.00 .89 
71 1 -51 1.00 | 1.00 
32 4 .12 | 1.00 .89 
.00 | —.15 .02 | 1.00 71 
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empirical meaning of the statistic C 
as well as other components of 
achievement, we now turn to the 
question of how the subjects will 
perform. In order to approximate 
roughly three possible approaches to 
clinical training, the subjects were 
studied with the following questions 
in mind: 


1. Can subjects learn to predict 
the criterion in a task of this nature 
having no prior information regarding 
the properties of the task? 

2. Can subjects learn to predict 
if given the ‘theoretical’ structure 
(linearity and nonlinearity) of the 
task? 

3. Can subjects learn to predict 
if they are instructed«as to how the 
“theoretical” structure applies to the 
specific task? 


PROCEDURE 


Subjects were given a two-cue inference 
task having the statistical properties described 
above. Stimulus materials consisted of 100 
3} X 6 inch cards divided into 3 blocks of 
20 cards each. Printed on every card were 
two 10-point “test scales" with the correct 
criterion value (hypothetical trait) on the 
back of each card. Instructions were focused 
upon providing different types of information 
as implied by the three learning conditions 
described above. Thus, Group I was told 
simply to make inferences on the basis of the 
two test scores and that accurate prediction 
was possible. The subjects in Group II were 
told that both linear and nonlinear relation- 
ships were involved and that the highest 
accuracy possible was contingent upon utiliza- 
tion of both. Examples of linear and non- 
linear prediction were provided. (The subjects 
in Group III were also provided with informa- 
tion regarding linear and nonlinear relation- 
ships; in addition, the linear and nonlinear 
cues were identified. All subjects observed 
the correct criterion value following each 
prediction. For half of the subjects in each 
group, the lincar cue was on the right; for the 
other half, the linear cue was on the left. 


RESULTS 


“Ichievement.  Vigure 2 shows the 
mean achievement correlation for each 
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Achievement (in Z torm) 


em Group 1 
e=. Group E 
— Group ar 


5 


! 2 3 4 
Blocks of Trials 


Fic. 2. Mean achievement correlations (in 


z form) for each condition over blocks of 
trials, 


experimental group over the 5 blocks 
of 20 trials. Achievement was quite 
high for Group III (£&, = .85), some- 
what lower for Group II (® = .71), 
and lowest for Group I (Z., = .49) on 
the final block of trials. Thus, as one 
would expect, the subjects given the 
most information (Group III) per- 


< .05) than the 


Group I 
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TABLE 2 


ANALYSIS OF VARIANCE OF MEAN 
ACHIEVEMENT CORRELATIONS 
(тх 5 SCORES) 


Е 
Source of variance | df | MS | 
7 .59* 
Instruction conditions (A) 2 gan 1 E 
Error (between) 27 1 451 5.88** 
Blocks of trials (B) 4 | 104. 5-98 
AXB 8 | 2188 
Error (within) 108 17.75 = 
*p < .05 
эсш 


performance over all groups (P < oe 
although Group I contributed least x 
this increase. There was no MR 
that one group learned more гар! i 
than another, since the intera a 
between groups and blocks was Lais 
statistically significant, The ict 
in Figure 2 suggest that learning 18 
reached an asymptote by the end. А 
the 100 trials, with a clear separatio 
of all three groups. The 
Differential Cue Dependency. . ed 
extent to which the subjects wr 
the linear (X;) and nonlinear (sin X? 
cues is illustrated in Figure 3. tes 
Wilcoxen signed-ranks test indican 
that the increase in the dependenct 
the linear cue from Block 1 to Bloc 


—— 
P d d 
а NE / 
/ 
[4 
/ 4 
З imal 
"4 жарыгын pee 
эы. у 2. sin Xg (NO 
= Linear) 
5 
! e 3 4 51 2 3 Ы 
Blocks of 


Fic, 3. 


Utilization coefficients for linear cue (Xx 


Trials 


1) and nonlinear cue (sin X2). 
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15 significant (р < .02) only in Group 
II, and the increase in dependence 
upon the nonlinear cue from Block I 
to Block V is significant (р < .01) 
only in Group III. Thus, the nature 
of the information given the subject 
determines whether the linear or non- 
linear cue will be increasingly utilized 
during learning. The graph in Figure 
3 also indicates that there is a much 
stronger dependence on the linear cue 
than on the nonlinear cue in the mini- 
mum information condition (Group 
I); the curves are more widely sepa- 
rated here than in any other condition. 
Note also that the utilization of 
both cues does not reach optimality. 
Perfect achievement would require 
that both curves reach the s = .89 
(r= .71) line at the top of the graph. 
Overall, there appears to be a tend- 
епсу to depend more on the linear cue. 
Total Linearity. Figure 4 shows the 
mean linearity (2,2) of the subjects in 
each group. Clearly the different in- 
oe did not differentially affect 
пе extent to which the subjects com- 
bined the cues in a linear manner. 
1 Or was the increase in R, from Block 
i Block V statistically significant 
т any group. Figure 3 indicates 


70 
e—.—e Group I 
+———+* Group I 
50 + e Group Ш 
os * Optimal 
Qo Linearity 
& .50 
= 
5 -40) 
v 
= 
4 30) 
c 
Ы 
£ 20 
10) 
00! 


і 2 3 4 5 
Blocks of Trials 


Fig 
з. 4, К s 
Mean response system linearity for 


cach condition over blocks of trials. 


1.20, 


Z Scores (C) 


*—.—* Group I 
+-——+ Group II 
— Group ш 


Blocks of Trials 


Fic. 5. Mean C correlations (in z form) for 
each condition over blocks of trials. 


that in Group I a greater proportion of 
the total linearity (R,?) may be at- 
tributed to linear utilization of the 
nonlinear cue than in either of the two 
other groups. 

Most important is the fact that the 
linearity of the subjects is less than 
optimal, and the absolute value of the 
mean R,? is quite low. 

Accuracy of Nonlinearity (C). Fig- 
ure 5 illustrates the differential re- 
sponse to the nonlinear aspects of the 
task in the three groups. Group III 
clearly developed the largest value 
of C. The difference between the 
effect of instructions was statistically 
significant (p < 05; see Table 3). 
Trials also had an effect on the extent 
to which the subjects correctly utilized 


TABLE 3 


ANALYSIS OF VARIANCE OF MEAN C 
CORRELATIONS (IN 2 Scores) 


Source of variance df MS F 
Tnatructionconditions (4) | 2 | 726.95 | 420* 
Error (between) 27 | 17282 
Blocks of trials (B) 4 | 13804 | 6.71" 
AXB в | 2154 | 1.04 
Error (within) 108 | 2055 
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the nonlinear cue, there being a 
significant (р < .05) increase in the 
value of C over all groups. 

Linear and Nonlinear Components of 
Achievement. Figure 6 shows the 
relative contribution of the linear 


| 
2 


and nonlinear 


[ceva = Raa — R2)] 


components of achievement (re) for 
each group.* Appropriate weighting 
of the cues in this task would re- 
quire that each component contribute 
equally to achievement. Inspection 
of these curves indicates that Groups 
П and III are clearly superior to 
Group I in adapting to the equal con- 


tributions of the two components of 
the task. 


Discussiox 


The issue of additivity Versus pat- 
tern plays a central role throughout 
the history of psychology. 


) Because 
psychologists have Persistently taken 
an either-or posture toward this 


matter, it was only natural for them 
to take an either-or posture toward 
the linearity versus nonlinearity issue, 
as they have toward the clinical 
versus statistical prediction issue. All 
three are merely different Versions of 
the same problem. Yet Psychological 
experiments have shown that humans 
can solve problems of inductive infer- 
ence in either an additive (linear) or 
patterned (nonlinear) manner, 

It is possible to demonstrate pure 
cases of either process, however, only 
in very peculiar circumstances such 


5 In order to expre in z score form, the 
two components in such a way that they add 
to produce ra, also in z score, a transformation 
to z for each component was based on the 
proportion each contributed to Ya in raw score 
form. 


Kenneta К. HAMMOND ann DAVID A. SUMMERS 


as psychological experiments. While 
laboratory demonstrations of either 
process in pure form may be of some 
interest, the very peculiarity of the 
circumstances necessary to elicit them 
makes their relevance dubious. Those 
who wish to investigate the AT 
capacity for inductive inference WH 
not wish to restrict themselves to 
conditions which evoke either p» 
process or the other. More to e 
point, we believe, is the analysis ai 
inference in situations where both 
linear and nonlinear cognitive proc- 
esses can contribute to problem solu- 
tion. What is needed is the investiga 
tion of the human's capacity to bring 
both processes to bear on problems 
which require both for appropriate 
functioning, t 

The specific results of the presen 
experiment indicate that achievement 
can be quite high in a task of this ane 
but that achievement is contingen 
upon information about the task. 
Such information also determines 
whether the linear cue or the ida 
linear cue will be increasingly utilize 
during learning, І йе 

The mean linearity (R2) of t 
subjects was less than optimal y^ 
quite low, ranging from .21--42- 
This finding is important because 
although results or previous studies 
indicate a high degree of linearity z 
the part of the subjects in multiple 


pensity for a highly linear, additive 
is contingent i 
the subject being presented with 4 
x system. — 
. 1€ results support the empirica 
significance of C (see Equation 1) = 
a measure of the relation betwee! 
nonlinear variances in the task 
Tesponse systems of the subject. Fue 
C increases ag a function of both ие 
structions and trials in a situation # 
which the nonlinear cue is known a” 
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140, 
Group I Group II Group III 
—" 
гро *——-—* Linear ? gee” 
Component 
*—-—* Non-Lineor 
1.00| Component 
H 
N 
c 
5 . p 
2 7 р 
" F a 
Bre T "d 
“7 d 
e= "362 "4 
pP d 
7</ 
T 
oe a a (шшш ый — 
2 3 4 51 2 3 4 5 
Blocks of Trials 


Fic. 6, Achievement correlations 


and component scores for each condition 


over blocks of trials. 


ee the experimenter, a result 

previously available. 
aoe the results indicate that 
e relative contributions to accuracy 
of the linear and nonlinear compon- 
med E performance were about equal, 
Sisi od in the minimal information 
т ion, where the main contribu- 
йен achievement сате from the 
ea In view of the fact 
mds linear and nonlinear com- 
bone c: the task itself were equally 
See this result provides un- 
Bend and impressive evidence of 

ive adjustment. 
eo results of current studies 
plane, ers & Hammond, 1964) em- 
ead е with an imbalance be- 
Ponen inear and nonlinear com- 
ef ah 5 шо point toward matching 
Soman, inear and nonlinear com- 
MAN of achievement with the 
Pective components of the task. 

apu tenen of all the above 
restrict дйн must, of course, be 
the Ph iy to the specific conditions of 
йат k; only one linear and one non- 
cue was present, and the non- 


linear function chosen is only one of a 
multitude of nonlinear functions con- 
fronting the human problem solver. 
(See Hoffman, 1960; Meehl, 1954, 
for discussion of possible nonlinear 
relations representative of clinical 
inference.) In addition to number of 
cues and type of nonlinear function, 
type of feedback (statistical as Op- 
posed to the traditional outcome 
feedback employed here; see Newton, 
in press; Todd & Hammond, in press) 
and amount of linearity (К; see 
Hursch et al., 1964) are certainly rele- 
vant dimensions along which investi- 
gation should be focused. lthas been 
the purpose of this paper to demon- 
strate that such investigations are 
possible within the conceptual and 
methodological framework of Bruns- 
wik’s probabilistic functionalism and 
that the necessary analytical tools are 


available.* 


computer program for 
atistics discussed in this 
on request at the Insti- 
Science, University of 


sA (Fortran) 
calculating the st 
article is available up 
tute of Behavioral 
Colorado. 
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It is believed by many that the neural engram undergoes a consolidated 
phase after a learning trial. During this phase the engram is particu- 
larly susceptible to disruption by traumatic incidents. A review of the 
literature on electroconvulsive shock (ECS) provides little, if any, 
support for this point of view. ECS does, however, result in retro- 
grade amnesia (RA). If the RA is not due to disruption of a 
consolidation process, then what are its sources? The writers in- 
terpret the literature to mean the ECS produces an inhibition of the 


Pavlovian variety whic! 


h follows the known laws of learning and 


becomes conditioned to stimuli of the ECS situation. This point of 
view is shown adequate to explain much of the data produced by ECS 
studies and is consistent with other interpretations of the effects of 


massive stimulus discharges. 


and" few psychologists of any theo- 
Diner persuasion would deny that a 
Been d trial produces some kind of 
ЖОШ. їп {һе nervous system. Most 
Kao agree that this change must be 
rel ctural one to account for the rela- 
lavi) permanent modifications of be- 
n n constitute learning. How- 
Pod there has been considerable dif- 
this ce of opinion about the nature of 
S residue of experience—the 
жыз m. Most physiological psycholo- 
а reg those physiologists and bio- 
in e d who have shown an interest 
em ae d seem to prefer an engram 
1961) ation interpretation (Glickman, 
tion am The consolidation interpreta- 
Several ifficult to describe, for there are 
Баа 3 versions of it, and none of them 
de ееп elaborated with any great 
rog of specificity. 

ies p» ee theorists seem to 
696. at immediately following а 
tively t trial the engram is in a rela- 
Susce e condition and is therefore 
tion P 2 to impairment or eradica- 

+ If it is impaired during this 


1 
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period, memory loss or forgetting oc- 
curs. In a short periód of time follow- 
ing the learning trialj however, the 
engram consolidates, becoming a rela- 
tively permanent residue of the learning 
experience and resistant to destruction. 
We need not at this moment be con- 
cerned whether the engram is molecular 
in nature, or cellular, electrical, or 
chemical, or any of the other many 
possibilities. The major notion that 
concerns us here is that the engram has 
an initial phase in which it is vulnerable 
and easily destroyed. 

The consolidation theory of learning 
was first proposed by Miiller and Pil- 
zecker (1900) in order to account for 
the forgetting of verbal materials. 
They knew that a list of verbal mate- 
rials interpolated between the learning 
of an original list and its recall would 
result in interference with the retention 
of the original list—the phenomenon 
of retroactive inhibition. Müller and 
Pilzecker hypothesized that the en- 
grams produced by the recitation of the 
items from the original list were not 
yet consolidated and thus were sus- 
ceptible to disruption produced by re- 
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citing items from the interpolated list. 
Although widely accepted for some 
years by experimenters on verbal 
learning, the consolidation theory has 
few adherents among this group today. 
Too few of the variables manipulated 
by the verbal learners show the tem- 
poral effects demanded by the con- 
solidation theory. 

The phenomenon of proactive inhibi- 
tion, for example, is difficult to encom- 
pass within a consolidation framework. 
Retention of verbal units has been 
shown to depend upon the nature of 
the list of items previously memorized 
(Underwood, 1957). Here we have a 
memory deficit brought about by pre- 
ceding events—by events occurring be- 
fore the second-list engram was in 
existence. Also the degree of simi- 
larity among lists of items has been 
Shown to have a great effect upon re- 
tention. For these and other reasons, 
many workers in verbal learning (e.g., 
Postman, 1961; Underwood & Post. 
man, 1960) espouse a competition 
theory of forgetting, The competition 
theory maintains that the learning of 
one response does not destroy or oblit- 
erate another. The “unlearning” that 
is said to take place is temporary and 
transitory (Postman, 1961) аз indi- 
cated by the spontaneous recovery of 
what was unlearned and by the Savings 
with which the unlearned may be re- 
learned. The possible degree of con- 
solidation of a neural engram is not 
involved in such a conceptualization, 

One of the most important consid- 
erations in evaluating a consolidation 

hypothesis is the length of time during 
which consolidation may be presumed 
to take place. Many authorities be. 
lieve that this time period is quite 
short. Pearlman, Sharpless, and Jar- 
vik (1961), for example, say, “It is 
agreed that the interval during which 
such agents can exert significant retro- 
grate effects is relatively short, a few 
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hours at most [p. 109]." Deutsch 
(1962) places it at about 1 hour Mc 
John (1961) reduces it to between 4- 
hour. Most experimental tests of the 
consolidation hypothesis have involved 
the manipulation of variables intended 
to speed up or slow down the fixation 
of the engram. The theory demands 
that these variables must have a greater 
effect when manipulated soon after in 
learning experience than when this 15 
done some time later. Variables ad- 
ministered a considerable time after 
a learning experience—say 12 or € 
hours—but which nevertheless rest! 
in a memory deficit cannot be con- 
sidered as producing data germane tO 
à consolidation notion. d 
Glickman (1961) recently evaluate? 
the evidence relevant to the consolida: 
tion theory, and it was his opinion Pe 
"experimentally induced RA t 
grade amnesia) has produced the Les 
evidence for the existence of a conso | 
dation process since the results won B 
be predictable from perseveratio! 
theory, while the primary competing 
theory, the associative interference 
theory, has no explanation to offer [Р 
220].” Glickman states that the “ga 
jority of the evidence supporting = 
consolidation hypothesis comes fro k 
Studies in which electroconvulsive eer 
(ECS) is used to produce RA, and Т 
is this body of data that will be ОЧ 
main concern here. Our approach e 
be to take up the major experimen 
variables in which ECS has been ! 
volved and to evaluate the data "E 
from their manipulation insofar 25 re 
pertains to consolidation, Are qe 
adequate data to substantiate the " 
tion that a period of time follow? 
learning is necessary for an engran 
become fixed? :on ap" 
Without adopting a consolidation * 
Proach, one can still inquire into the 
Physiological processes underlyin Sjon. 
phenomena of proactive inhibi 
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Certainly physiological events do take 
place and, whatever they are, they must 
take place in time. The consolidation 
hypothesis is distinguished by the no- 
tion that the learning engram requires 
a period of time to become fixed and 
that this time period is long enough 
to permit external events to interfere 


~ Physically with fixation, but still not 


so long as to endure almost indefinitely. 


Tue Learninc-ECS INTERVAL 


The interval between the end of a 
learning trial and the ECS is a crucial 
One because it is during this period that 
the engram is presumed to consolidate. 
Duncan (1949) manipulated this vari- 
able in a classic ECS experiment that 
gave support to a consolidation theory. 
Using a shuttle box, he gave his sub- 
Jects one trial a day followed at dif- 
ferent time intervals by ECS. The 
intervals between the learning trial 
NG the ECS for the various groups 
tee 20 seconds, 40 seconds, 4 min- 
m. 15 minutes, 1 hour, 4 hours, and 
dà m He found that the relation- 
"^ B etween the number of successful 
t Oidance responses and the time in- 
*rvals was sharply negatively acceler- 
ын , There was no evidence of learn- 
inn with the shortest interval, indicat- 
e total RA, but if 1 hour or more 
Pe pid between the end of a trial 

the convulsion, there was no ap- 
parent memory loss. 


^ Gerard (1955) administered ECS 


at oe of 5 minutes, 15 minutes, 
band. and 4 hours following maze 
the ч Не found no learning at 
Nd hortest interval and no impair- 
at the longest. Thompson and 
xe (1955) used a choice situation 
wae a single ECS either 10 sec- 
о 2 minutes, 1 hour, or 4 hours 
eas guns of acquisition. They 
val no learning at the shortest inter- 
and no impairment at the longest. 


Heriot and Coleman (1962) used a 
one-trial avoidance situation in which 
the subjects were taught to avoid a 
bar which they had previously learned 
to press. One ECS followed the 
avoidance experience at either 1, 7, 26, 
60, or 180 minutes. The three shortest 
intervals obliterated the effects of the 
avoidance training, the longest inter- 
val had no effect, and the 60-min- 
ute interval was intermediate. These 
studies, a representative sample of a 
number of others, are all consistent in 
showing retention to be a negatively 
accelerated function of the interval be- 
tween learning and the administration 
of ECS. None of them indicates that 
ECS is effective with an interval as 
long as 4 hours, and they all can be 
considered to fulfill the assumptions 
which support a consolidation hypothe- 
sis, although there are other equally 
tenable explanations of these data. 
There are a number of studies, how- 
ever, which are not supportive of a 
consolidation hypothesis in that they 
have involved the administration of 
ECS at greatly extended intervals fol- 
lowing the last learning trials. Most 
of these also show that ECS results in 
some impairment of a previously 
learned response. Carson (1957), for 
example, gave a series of ECSs, two a 
day over a 20-day period, which began 
24 hours following the last learning 
trial, A marked deficit due to ECS 
appeared, and the deficit lasted over a 
period of 15 days at which time the 
experiment ended. Williams x 1963) 
gave ECS to one group of subjects 24 
hours after the completion of training, 
and to a second group 13 days later. 
The response, à conditioned emotional 
response (CER), was greatly attenu- 
ated by the ECS following traming by 
one day and was still slightly effective 
by the ECS given 13 days later. 
“Geller, Sidman, and Brady (1955) 
allowed 3 days to intervene between 
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training and ECS and found that the 
learned response, a CER, nevertheless 
had disappeared as a result of the con- 
vulsive treatment. Very significantly 
the CER had returned when the sub- 
jects were retested 30 days later. The 
attenuating effect in this case was not 
the permanent one that should have 
resulted if an incompletely consolidated 
engram had been destroyed. Previ- 
ously Brady (1951) had given ECS 30, 
60, and 90 days following the learning 
of a CER. He found no immediate 
effect on the retention of the CER, al- 
though the three groups did align 
themselves regularly on a resistance to 
extinction response measure which 
would indicate some attenuating ef- 
fect for the very long intervals used. 
Stone and Bakhtiari (1956) gave 40 
acquisition trials on а 13-choice multi- 
ple-T water maze with a 24-hour inter- 
trial interval. When ECS was ad- 
ministered 30 days following the last 
acquisition trial (70 days after the first 
trial), a deficit appeared temporarily 
on a subsequent retention test. but the 
deficit disappeared with further train- 
ing. 

Braun, Patton, and Barnes (1952), 
using monkeys, gave 515 object-quality 
discrimination problems on the Wis- 
consin General Test Apparatus which 
took approximately 56 days to com- 
plete. By the end of this experience 
the subjects had developed a highly 
efficient learning set and could learn a 
new problem much quicker than they 
did at the beginning of training. Seven 
days after the training was over they 
were given a series of 20 convulsions, 
3 а week. When retested 3 days 
after the last convulsion, the treated 
animals were markedly inferior to con- 
trols, and this inferiority lasted over a 
series of 96 more problems given 8 
a day. In this experiment ECS was 
given 63 days after the completion of 


all learning trials. This is an enor- 


mously long time for consolidation to 
continue. 

We have seen that ECS interferes 
with a previously learned верасе 
when it is administered as long as 3 
or even 60 days following the conclu- 
sion of learning. We have also seen 
that the deficit brought about by ECS 
can also be somewhat temporary. 
These two facts make difficulties for а 
consolidation theory. They anie 
that retrograde amnesia can be brough 
about by ECS even when the engram 
associated with original learning en 
completely consolidated —assuming tha 
engrams do consolidate. We will re- 
turn to this point later. 


ECS—LEARNING INTERVAL 


The data of proactive inhibition were 
à source of great difficulty for those 
holding to a consolidation point is 
View as an explanation for the forge 
ting of verbal materials. Тһе c 
difficulty seems to prevail using m is 
Poschel (1957) first gave his cwn 
approach training in a runway. Ther 
they received a series of 10 ECSs T 
2 days. Next they were placed in mE 
end box of the runway and given h 
nonconvulsive shock to the feet aie 
а grid floor, Seven days later tes 
Were made to determine if they wou 
tend to avoid the end box where they 
had received foot shock. The апіта ё 
Who had received ECS showed a 
avoidance of the end box although ч 
avoidance was clearly present in o 
trol subjects who did not receive Ё at 
The ECS, given before the рипівћте 
in the end box, interfered with t 
earning of the avoidance response. 

Adams and Lewis (1962a) use ich 
two-compartment shuttle box in whi st 
animals learned to run from the jo 
compartment into the second Decal. 
of a grid shock to the feet in the ПА 

efore this shuttle training, one E Cos 
of subjects was given a series of E 
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in the first compartment of the shuttle 
box. Then, 3 days later, they were 
given avoidance trials. The animals 
given ECS had much greater difficulty 
learning the avoidance response than 
did another group which had not re- 
ceived ECS first. How can an engram 
be interfered with 3 days before it 
Occurs? There is little possibility that 
the response interference could be due 
to the postconvulsive confusion re- 
Ported for human patients since such 
Confusion lasts very few hours at most 
(Krains, 1957, p. 466). 

Leukel (1957) used a 14-unit multi- 
Dle-T water maze in which rats received 
One trial a day for 10 days. Either 1 
азна 5 minutes, 30 minutes, or 2 
meee after a daily learning trial they 
4a m ECS. At the conclusion 
fe Ody acquisition ECS phase 
in um jects were given further trials 
Šen i water maze, one a day, until they 
= ched a criterion of three successive 

rorless trials. There followed a 30- 
= idle period and then more trials to 
eni criterion in order to measure re- 
owed The initial acquisition data 
Bionn a response depression for all 
MS independent of the trial-ECS 
at a finding not helpful to the 
sui idation notion. A more inter- 
int 8 result of this study for our pres- 

Ё а ы was that the response 
mained р resulting from ECS re- 
ing " through both the initial relearn- 
der P and the 10 retention (and 

his ‘ning ) trials given 30 days later. 
* и further evidence that the effects 
ing CS can interfere with later learn- 
eff« le., proactively; in this case the 

Ct of ECS remained and interfered 


wi х 
а Telearning as much as 30 days 


Spacine or ECS TRIALS 


7 he prediction of a consolidation 
Ty regarding the spacing of ECS 


*Xperj 
seems clear enough. The 
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closer ECS follows a learning trial 
and the more compact the series of 
convulsions, the greater should be the 
interference of ECS with an engram. 
An experiment by Brady, Hunt, and 
Geller (1954) brings some information 
to bear on this question. 

After their animals had acquired a 
bar-press response they were presented 
with a clicking noise which was fol- 
lowed by shock to the feet. After a 
few trials of this sort, the subjects 
stopped pressing the bar (CER or con- 
ditioned emotional response) when- 
ever the clicking occurred, although 
they would continue to press otherwise. 
Twenty-four hours after the CER was 
well developed, all groups of subjects 
received 21 ECSs, but with a different 
time interval between treatments for 
each group. One group received ECS 
every second, and other groups were 
convulsed at intervals of 30 minutes, 
1, 8, 24, or 72 hours. The very inter- 
esting results showed virtually com- 
plete attenuation of the CER in the 
groups receiving ECS at 1-, 8-, or 
24-hour intervals. ECS given every 
second, at 30-minute or 72-hour inter- 
vals did not produce as much effect. 
Some very complicated assumptions 
could be made about the course of 
consolidation in order to preserve the 
theory, but they would be so specific to 
this experiment that their value would 
be doubtful. 


SPACING OF ORIGINAL 
LEARNING TRIALS 


Thompson and Pennington (1957) 


gave intertrial acquisition intervals of 


45 seconds, 2, 3, 4, 5, and 6 minutes on 
a visual discrimination problem and 
continued learning trials until an ac- 
quisition criterion was attained. ECS 
followed immediately. Their results 
indicated that ECS had less effect with 
the longer intertrial intervals for origi- 


nal learning. The experimenters at- 
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tributed their results to the increased 
time within the widely interspersed 
acquisition intervals permitting neural 
consolidation to occur. They say, 
“This follows from the theory if it is 
assumed that the amount of consolida- 
tion of the memory trace increases as 
a function of the duration of persevera- 
tive activity. Thus, a long intertrial 
interval allows the posttrial organic 
events to produce greater consolidation 
than a short intertrial interval [p. 
401].” | 

An implication of this argument is 
that engrams even from identical train- 
ing trials interfere with each other 
when they follow closely one after an- 
other. The evidence, however, is quite 
strong (eg. Kimble, 1961) that the 
effect of massed acquisition trials is a 
temporary one and therefore due to 
performance variables rather than 


learning ones. This militates against 
a consolidation interpretation, 


ExtINctIon or RA 


Adams and Lewis (1962b) showed 
very clearly that RA due to ECS need 
not be permanent, They showed, fur- 
ther, that the RA could be experi- 
mentally removed. Two groups of 
animals were given a Series of acquisi- 
tion trials in a shuttle box with each 
trial followed immediately by a convul- 
sion. The experimental Subjects were 
then returned, two at a time, to the 


start compartment of the shuttle box 
and left there for 5 minutes on 5 suc- 
cessive days. The other group re- 
mained during this time in their living 
cages. Then both groups were given 
reacquisition trials. The animals that 
had had extinction experience in the 
shuttle box relearned much faster than 
the controls. This indicates that the 
RA produced by ECS is not perma- 
nent, and that recovery can be con- 
trolled. 
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EMOTIONAL RESPONDING 


There have been a host of studies 
which seem to indicate that ECS po 
a special antagonism for ап тошаЬ re 
sponding. The first clear demons 
tion of this was made by Hunt P 
Brady (1951), and their series ied 
researches in this area has contribu s 
considerable information relative pis 
consolidation theory. Brady md s 
colleagues used approximately the 5 нн 
experimental arrangement, which fon 
already been mentioned briefly, The 
most of their convulsive studies. Ta 
subjects were first taught to press THEN 
on a variable-interval schedule. з doit 
they were presented with a brie vid 
associated with a shock through. ag " 
floor. After a few trials of this c 
the animals stopped responding as $ 


to rê- 
as the tone sounded and returned ihe 


a conditioned emotional 
(CER). Three or 4 days after 
last conditioning trial, and dae 
CER is strong, the subjects prp 
a series of ECSs, Brady and his s 0 
leagues typically administer a serie for 
21 ECSs at the rate of 3 a бау. ет 
7 дауз. Then, typically, 2 days ined 
the last ECS, the subject is retu ne 
to the Skinner box and the conditio ex- 
stimulus (CS) presented. Тһе ive 
perimental animals, those that geen 
the ECS, continue pressing at 2 the 
the same rate as they did Lagi | 
Conditioning of the CER. The “ately 
animals stop pressing imime These 
upon presentation of the CS. es 
findings indicate that ECS EN 
attenuate a CER but that it aon 
discernible effect upon the bar tonal 
Tt would seem either that ar М to 
responses are particularly sensit! re 
the effects of ECS, or that the bar P 

is not. 
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" There are at least two features of the 
р rady et al. (1954) studies having un- 
avorable implications for a consolida- 
Bip a First of all, there is the 3- 
ui, e between the end of 
dim meri and the first administra- 
ы CS. This is a long time for 
eae bi still to be consolidating. 
= ‚ ECS has no clear effect оп the 
аг press under Brady's conditions. 
ere interesting experiment by 
dis y( 952), using the standard para- 
gm described above, shows that the 
oe recovers to some extent 
ECS ex days after the cessation of 
that Pertence This can only mean 
tion ^ effects of ECS, in this situa- 
"T transitory. Brady has also 
ECS К, aS we have seen, that when 
lot Ә 15 given 30, 60, ог 90 days fol- 
m. the conditioning of a CER, the 
үл no attenuation on а reten- 
the cee en 4 days later. When 
Silha Y was subsequently extin- 
on des however, the rate of extinction 
eias бон to the learning ECS 
Rushed i the 30-day subjects extin- 
ies Ev followed by the 60- and 
Be Poles jects in that order. Again 
ven arning-ECS interval is important 
though the shortest interval is 
days, 
Nu Jernberg, and Brady (1952) 
Коз e. that if extinction is given 
Cover a the CER will not re- 
» imas hough the CER will recover 
; Ing ECS if no extinction period 


is а 
oe A careful look at the “close 
et а], of responding in the Hunt 


CER 0952) study revealed that the 
eas never totally obliterated by 

ata CS but was only greatly attenu- 
a Probably extinction had its ef- 
Pon this weakened CER. This 

ra сш that a process similar 
“able to aHeous recovery may be appli- 
haps it attenuated CER, and per- 
Ne eff, also points to the similarity of 

cts of ECS and extinction. 


feet 
Stud. 
D s 
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Thus, Brady's data show that when 
a CER is superimposed upon an instru- 
mental response the ECS can attenuate 
the CER, leaving the instrumental re- 
sponse undisturbed. This finding has 
been duplicated by a host of investi- 
gators using quite a large number of 
experimental situations, and there 
seems to be absolutely no question 
about its reality. Heistad (1955), for 
example, trained animals to tilt a box 
which was balanced on a fulcrum. 
Tilting the box allowed water to flow 
into a trough and the thirsty animals 
could then drink. After the instru- 
mental response was well acquired, a 
CER was instilled by means of shock 
through a grid floor in the apparatus; 
the animal now crouched and huddled 
instead of tilting the cage for water. 
Two days after the last CER trial 
the experimental animals were given a 
series of ECSs. When put back in 
the apparatus, the animals returned im- 
mediately to tilting the cage for water; 
the CER had been almost completely 
obliterated. 

Williams (1961) first taught animals 
to approach food in a runway. Then 
they were shocked through a grid in 
the floor of the last section of the 
runway to produce a conflict demon- 
strated by “hesitations” as the animals 
approached this section. A series of 
ECSs commenced 1 day after the last 
conflict trial. The effect of the ECS 
was virtually to obliterate the hesitation 
without interfering with the overall in- 
strumental running response. In an- 
other study, Heriot and Coleman 
(1962) trained subjects to press a bar 
in a Skinner box. Then they were 
given a single strong shock upon touch- 
үг which generated avoidance 
of the bar. ECS was given a short 
time following the bar shock, and when 
the animals were returned to the box 
they began to press the bar again; 
avoidance was gone but not the bar 


ing the be 


232 


press. Madsen and McGaugh (1961), 
in an experiment similar to that of 
Pearlman et al. (1961), placed their 
subjects on a small platform suspended 
over a charged metal plate. When the 
animals stepped off the platform, they 
received a shock to the feet through 
the metal plate. Their Tesponse was 
to withdraw from the edge of the 
suspended platform and not to step off 
it. They were then given a single 
ECS and returned to the platform 
upon recovery. Now they stepped off 
the platform again as if they had never 
received shock through the metal plate ; 


again the CER was selectively affected 
by ECS. 


Tesponse is also at- 
tenuated by ECS, Heistad (1955) 
trained animals to move from one com- 
partment to another in a circular maze 
when a CS was presented. 
moved soon enough 
ance of the CS 
delivered through the grid floor. After 


was 
als were given a series 
Е h effectively reduced the 
avoidance response. In this study the 
active muscular Tesponding—the avoid- 
ance—was attenuated by ECS, This 
15 In contrast to the Br: 


Carson (1 
fashion attenuated а w 
avoidance response by ECS 
host of other studies (e.g., Duncan, 


ECS. Tt 
may be that both the instrumental re- 


sponse and the emotional response have 
been interfered with in these studies, 
although it is also possible that the 


l avoidance 
response is reduced is because the fear 
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lying behind it has been attenuated by 
the ECS. 


Anesthetics 


The use of anesthetics along with 
ECS has produced some interesting 
information because some anesthetics 
can prevent convulsive behavior пи 
occurring even though а considerab 
“convulsive” current is used. х 

Porter and Stone (1947) had their 
animals learn a multiple-U maze an 
then subjected them to ECS. b» 
group experienced ECS while unde 
ether anesthetic, and the other Lio 
Was unanesthetized. The result 
showed that ECS had little effect *4 
retention when given under ane 
but had a great effect otherwise. НЫ 
Jernberg, and Otis (1953) found hei 
convulsions produced by carbon disu 
Phide attenuated a CER in a lw 
animal but not in an otlier anesthe a 
one. . Hunt, Jernberg, and Lawlo 
(1953) found the same effect duae 
ECS as the convulsing agent. ae 
McGinnies, and Box (1949) gave E ir 
following runway training when the? 
animals had received either an inje 
tion of neutra] saline or a nembuta i 
anesthetic, They found a pues 
decrement due to ECS when the ^ 
Jects were injected by the saline > 
not when under the anesthetic. Eac i: 
of these Studies indicates that RA doe 
not occur when the ECS is given whi 
the subject is under an anesthetic. J- 

Pearlman et al. (1961) in an pro 
lent experiment add a possible wap 
cation to the Picture by showing 01 
ап anesthetic itself can produce a 
They first bar trained animals pe 
Stable rate or responding and then £ e 
them a CER produced by a pam 
shock delivered through the reinforce 
apparatus. Either 20 seconds, 5 mines 
utes, 10 minutes, or 20 minutes oe 
the CER the animals received а gh 
dium Pentobarbital anesthetic throt 


Neur: е -— 
EURAL CONSOLIDATION AND ELECTROCONVULSIVE SHOCK 


chronically implanted catheters. The 
experimenters found that the anes- 
thetic produced RA as a function 
of the time between CER and the 
Administration of the anesthetic. In 
е experiment of {һе same 
hi, they gave pentylenetetrazol, 
tant acts as a convulsant, either 20 
"ye S, 2 hours, 4 hours, 8 hours, or 
^ ays after the CER. Here they 
а that the pentylenetetrazol was 
rs Ive in attenuating the CER even 
ies given 4 days aíter the CER. 
E ied that the sodium pento- 
with a produced RA by interfering 
the a consolidating engram but that 
det канен must have pro- 
| orgetting by some other mech- 
anism, 
ers and Quinton (1964) have 
н. BCS our knowledge of the effects 
fete in combination with anes- 
S ded Following the acquisition of 
it they administered carbon 
Hnc gas after each of several ex- 
bn They reasoned that if 
pe loxide disrupted consolidation, 
RTL ects of the extinction trials would 
Sener apparent; whatever learning ex- 
abet involved would be disrupted. 
чу» bar found that the anesthetic 
ШЫ the rate of extinction, a 
the g which they took to mean that 
anesthetic served as punishment. 


REPEATED ECSs AND LEARNING 
TRIALS 

in n" of the experiments cited so far 
ing "i ead have used repeated learn- 
of E S and repeated administrations 
have roc; As several experimenters 
have amd pointed out, these studies 
tion th ubtful relevance for consolida- 
(1954. ^ McGaugh and Hudspeth 
Peated 94 example, show that re- 
is not d CSs can bring about RA that 
ation ue to interference with consoli- 
© They maintain that only 
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studies using single ECSs are relevant 
to the consolidation process. 

When more than one learning trial 
is given, other difficulties arise. With 
repeated trials the early engrams have 
probably already consolidated by the 
time the later ones are created. For 
this reason Pearlman et al. (1961) 
have argued that only situations in 
which but one learning trial (and one 
ECS) is given are suitable to test a 
consolidation process. 

If these arguments concerning one- 
learning trial and one ECS are ac- 
cepted, it means that there are very few 
ECS experiments in the literature 
which support a consolidation notion. 
The frequently cited Duncan (1949) 
study, for example, cannot be con- 
sidered as evidence for consolidation, 
and all but one of those studies cited 
by Glickman (1961) must be dis- 
carded. The only published ECS ex- 
periments, by these criteria, that can 
even be considered as relevant to a 
consolidation process are those of Pearl- 
man et al. (1961), Heriot and Coleman 
(1962), McGaugh and Hudspeth 
(1964), and Weissman (1963). But 
even these few studies may not be rele- 
vant. Chorover and Schiller (1964), 
for example, present data which they 
interpret to show that the consolidation 
process is completed within 10 seconds 
after a learning trial. ]f they are 
correct, then this is the only ECS study 
relevant to consolidation. 

Jt seems that the pressure of the 
ECS data, much of which has been 
reviewed here, is increasingly restrict- 
ing the arena in which consolidation 
may be tested. Tt is not inconceivable 
that the possibility of such tests may 
soon vanish. The kind of evidence that 
is usually taken as most supportive of 
consolidation is that which shows RA 
to be a function of the time between 
learning and ECS. It is important to 
note that this type of gradient can be 
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found in both single- and multiple-trial 
experiments and appears even when a 
very long interval separates ECS from 
the learning trial (Brady, 1952; Wil- 
liams, 1963). 


A CONDITIONED INHIBITION 
HYPOTHESIS 


While the evidence is clear that ECS 
produces a decrement in the perform- 
ance of a previously learned response, 
it is far from clear that the data de- 
mand an explanation in terms of the 
disruption of a storage process. Inas- 
much as the problem of amnesia is, by 
definition, one of forgetting, there is a 
prima facie reason to examine the pos- 
sibility that ECS produces its effects 
by processes similar to those seen in 
other instances of forgetting. 

If we take the point of view that un- 
consciousness is an unconditional re- 
sponse produced by ECS, then the pos- 
sibility is opened that some weakened 
version of unconsciousness will become 
conditioned to whatever external stim- 
uli preceded the onset of the coma. 

OW coma, or unconsciousness, in- 
duced by any method represents a 
relatively extreme value of inhibition 
on a dimension of arousal-inhibition, 
The characteristics of Such a dimension 
are well enough described elsewhere 
that it is not necessary to elaborate 
them here. However, high values of 
inhibition are found in sleep, coma, and 
deep anesthesia. Lesser values are 
described in terms of general muscular 
relaxation, drowsiness, lowered activ- 
ity levels, and the like. 

Many workers in the field of neuro- 
psychology have followed the lead of 
Pavlov (1957) in regarding deep levels 
of inhibition as being “protective” in 
nature, i.e., they serve as a massive pro- 
tection against further stimulation and 
excitation when this would be hazard- 


ous to the organism, For example, 


Jung and Tonnies (1950) have gonn 
out the central importance of t 
mechanisms whereby the brain pre- 
vents and counteracts massive T 
chronous convulsive discharges, m 
we may here regard the coma маб. 
by ECS as an inevitable ина Б 
protective inhibition to such d 
stimulation. Further, we assume ti d 
pervasive inhibitory states of this EE 
are conditionable. At least this wou 
seem to be the case from the DUM. 
tions of Pavlov (1957) upon the m 
development of conditioned sleep i 
animals exposed to the monotony E 
restraint of the conditioning appari, 
In brief, the typical EOSRA epii 
ment appears to meet the specificati “vi 
for the induction of unconditioned pr з 
tective inhibition, and we should = 
pect that some weaker magnitude ed 
this state should become condition a 
to the surrounding environmental pee 
uli. By the same token we should dur 
pect that the power of a vage. 
environmental stimulus to elicit n 
tioned inhibition will diminish with is 
temporal distance from the onset of t Ё 
unconditioned inhibition. Та end 
words, the usual laws governing the vd 
lationship between response streng 
and CS-UCS interval will hold. " 
The conditioned inhibition that i. 
be produced by the piger im stin 
uli accompanyin the origina 
will be, жое ese somewha, 
weaker than complete coma. бепе 
muscular relaxation, lowered level 
activity, and the like should be -— 
When the animal is in the presence Ye 
the CS, Any local stimuli that де 
Particularly related to the delivery , Ve 
ECS and thus were discriminat! » 
stimuli for it Should become йш 
larly potent conditioned inhibitor 
or example, the present writers П З 
noted informally that, following En 
the rat will tend to relax and bem 
limp when the ear clips are reattac 
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for a subsequent trial—in marked con- 
trast to their usual reaction to the first 
encumbrance of ear clips. By the 
b ай, simple and rather natural- 
observation of rats returned to an 
apparatus in which they were previ- 
outy ,convulsed suggests that the 
crouching that interferes with avoid- 
ie oe is produced by muscu- 
cu AUN and is quite different 
я he tense “freezing” that is seen 

in the CER pattern. | 
M oes inhibition, like any 
їп ig ea response, should be- 
ations FIN with repeated presen- 
the UCS the CS, unaccompanied by 
b which n Thus, while the apparatus 
"s i he ECS is delivered should, 
tion н : retesting, elicit marked inhibi- 
het na should become progressively 
the E ent with repeated exposure of 
xb Ты this connection 
iios, es Adams (1963) describe the 
ete pt such extinction trials on ani- 
they м] ned to an apparatus in which 
“They ad preriausy received ECS: 
at a Pos left there for 5 minutes 
though ne on 5 successive days. Al- 
Heb n m formal record was kept of 
they ements, it was noticeable that 
this ы м тоге апа more active as 
516] х po experience continued [p- 
receivin T note further that animals 
niote iud ns extinction experience 
afe pe adily learned an active avoid- 
to thi sponse later. We shall return 

= observation a little later. 

esed the animals are behaving in a 
is furth manner in the face of the CS 
of Willi documented by the remarks 
earlier ye (1961) who confirmed the 
(1952) observations of. Hunt et al. 
“шше Williams pointed out an 
sive mae of electroconvul- 
ing ECS She noted that follow- 
in their Т the treated animals when 
in the mome cages “huddled in a ball 
горе] A corners of the cages OF 
on the food containers. They 
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were hyperirritable when handled and 
squealed, leaped or ‘froze’ when E at- 
tempted to lift them [p. 636].” This 
behavior was in marked contrast to 
their response when faced with the ex- 
perimental apparatus in which a CER 
had been established earlier: “In this 
experiment, the ECS animals behaved 
as if the grid were not emotion-arous- 
ing [p. 636].” They showed no fear, 
and exhibited the seeming paradox of 
an enhancement of emotional respon- 
siveness outside the apparatus but 
marked attenuation of it in the ap- 
paratus. 

It is entirely possible that the seem- 
ing paradox is resolvable by assuming 
that the stimuli most associated with 
ECS now elicit conditioned inhibition, 
and this state is especially antagonistic 
to high states of arousal. i.e., the states 
that prevail in the mediation of emo- 
tional behavior. The studies of Adams 
and Lewis (1962a, 1962b), already 
cited, are entirely congruent with this 
inference. When ECS was admin- 
istered at a location other than that in 
which the learned response usually 
occurred, it had less amnesic power 
than ECS given in the learning appa- 
ratus—even though the time interval 
was the same in both conditions. 
Adams and Lewis originally con- 
jectured that a partial form of the 
convulsion was conditioned to stim- 
uli associated with ECS. Thus, RA 
appeared because the animal was now 
crouching and huddling instead of per- 
forming the original response. One 
of the few data perplexing to this point 
of view was the observation that the 
animals were limp and relaxed upon 
being returned to the ECS situation; 
behavior hardly consistent with the no- 
tion of a conditioned convulsion. Ac- 
cording to the present point of view, 
the ECS results in inhibition which 
becomes conditioned to the ECS stim- 


uli, There is, then, a “learned relaxa- 
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tion,” which interferes and competes 
with the original response and pro- 
duces RA. 

Returning to the literature surveyed 
in the first part of this Paper, we note 
that the overwhelming bulk of investi- 
gations have demonstrated forgetting 
of avoidance responses, and in particu- 
lar the forgetting of avoidance re- 
sponses such as freezing that seem to 
be mediated by very high states of 
arousal Tt was pointed out that ECS 
was particularly selective for this type 
of behavior. This is по longer surpris- 
ing from an inhibition point of view as 
we should expect that precisely this 
kind of response would be most ad- 
versely affected by pronounced relaxa- 
tion—rather analogous to Wolpe's 

(1958) paradigm of "reciprocal inhibi- 
tion." The animal cannot freeze if he 
is relaxed. Responses that are medi- 
ated by somewhat lower levels of 
arousal will be less disrupted and will 
more readily reappear when the “emo- 
tional" response has been antagonized, 
However, in the absence of more com- 
prehensive investigation of the effects 
of ECS upon simple approach Te- 
Sponses, it is difficult to do more than 
speculate on this point. 
The assumption that ECS produces 
conditioned inhibition has some power 
to account for the Proactive “amnesic” 
effects that have been Cited earlier, Jf 
the new learning situation contains 
stimulus elements that were present 
when ECS was delivered (eg. ear 
clips attached to the animal, or an 
general apparatus similarity), then we 
should expect conditioned inhibition to 
interfere significantly with the 


acquisi- 
tion of a new learned response. Where 
this similarity was clearly present 


(Adams & Lewis, 1962) this deduction 
is indeed confirmed. 

In this connection we should note 
that the effects of ECS are apparently 
attenuated by prior anesthesia, This is 
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not surprising. In the first place, a 
subject, under anesthesia, is везу 
unconscious and at а high level © 
general inhibition. The м e 
passage of current should be шесе » 
and ineffective in conditioning mn а 
tion. It is quite possible to deduce t m 
if the unconsciousness induced by ie 
anesthetic is rapid, and done in the wd 
perimental situation, then it a 
have the same retrograde amnesic 
fects that ECS alone does. The ag 
man et al. (1961) study seems : 
meet these specifications and did, d 
We have seen, produce pen pem 
amnesic phenomena. The Leukel oe 
Quinton (1964) study is also Бая 
sistent with this interpretation. 7 
found that an anesthetic speeded up 
the extinction of a previously en 
response, rather than produced - 
for the extinction learning. We Éi 
Sume that it was the anesthetic-pro- 
duced inhibition that increased arn 
tion. The Hunt et al. (1952) stu rd 
points further to the combined effects 
of extinction and ECS, 


The main tenor of this argument 
may be summarized as follows: | " 
l. ECS produces massive protetti 
inhibition, a weakened version of apes 
then becomes conditioned to the me 
vironmenta] stimuli present at the tim 
coma was induced. n ill 
2. This conditioned inhibition sble 
Show itself behaviorally by measur З 
motor relaxation in the presence of 
conditioned stimuli, and this will bee 
to interfere with the puri 
any other response that was previott se 
conditioned to them. Where A 
latter responses were evoked un le 
high levels of arousal, as for examp 
in “emotional” responding, the o 
ation will be especially pronounce’ "Hon 
3. Because this conditioned inhibit! it 
is a learned pattern, like any amen o, 
will produce Proactive conflict W 
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new learning just as it produces retro- 
active interferences with old learning. 

4. Where the coma is induced in an 
€nvironment other than that in which 
the amnesic effects are to be measured, 
then these effects will be lessened. 

E The other laws of conditioning 
will apply to learned inhibition. How- 
ever, in view of the massive nature of 
the unconditioned stimulus, effective 
retention intervals may be expected to 
exceed the limits usually found in typi- 
cal laboratory-conditioning procedures. 
a =~ procedure such as anesthesia 
before ders the animal unconscious 

passage of current will prevent 
the establishment of an ECS-produced 
inhibition or its conditioning to local 
oue However, if unconsciousness 
induced rapidly by anesthetic, even 
without convulsion, then the same con- 
ted inhibition should develop to 
анан in which the anes- 
was given. 


PE none of this, per se, directly 
"m с upon the issue of engram de- 
uil ion or consolidation. Instead, it 
е to subsume a large number of 
les, many of them having some- 
си paradoxical relationships with 
ach other, under a single and more 
open rubric. It seems quite possible 
at the consolidation notion has not 
Hed ias tested at all. If engram for- 
M takes the brief span of time that 
then y defines immediate memory, 
ils most of the learning-ECS inter- 
to a by investigators are too long 
di ISrupt it. Furthermore, it seems 
tpe to regard any study in which 
trial subject has had more than one 
as appropriate to the problem, as 
re again the engram formed by the 
in trial and other early trials must 
uid be consolidated by the time the 
estigator gets to the ECS stage of 
the investigation. 
Ironically, it is the success of investi- 
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gators in showing “retrograde amnesia” 
from ECS given many hours, days, or 
months after the engram was formed 
that makes it clear that the effects of 
ECS must be attributed to something 
other than interference with consolida- 
tion. One of the present writers has, 
with Adams, offered an earlier and ten- 
tative hypothesis that the interference 
effects were produced by conditioning 
part of the convulsion pattern itself. 
However, the modification of this view 
to substitute conditioned inhibition 
seems better to fit the behavior of the 
post-ECS animal and also accounts 
more directly for the antagonism of 
ECS effects for high-arousal patterns 
such as the CER. 


CONCLUSION 


We have reviewed that portion of 
the ECS literature which has been con- 
sidered relevant to the problem of 
neural consolidation. We have seen 
that very little, if any, of this litera- 
ture provides convincing support for a 
consolidation process. This is in no 
way to deny the existence of such a 
process, for in many ways it remains 
appealing. It is simply to say that 
adequate evidence for consolidation is 
not available in the ECS literature as 
has been claimed (Glickman, 1961). 
We have further presented an inter- 
pretation of the effects of ECS in pro- 
ducing RA. This interpretation 15 
based on the concept of conditioned 
inhibition, and is not meant to я 
the possibility that other mechanisms 
are iis opéré The effects of ECS 


are undoubtedly very complicated. 
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THEORETICAL NOTES 


COMMENT ON THE EXCHANGE OF THEORETICAL NOTES 


BETWEEN SMITH AND BLACK AND LANG 


ROBERT B. MALMO 
Allan Memorial Institute, McGill University 


In an exchange of notes Black and Lang present Previously unpub- 
lished muscle-potential data indi 


curare drugs in (at least) greatly reducin; 
ments of cardiac conditioning. This weak 
of Smith who holds that ANS conditioning 
I comment on the muscle-potential data of Black and Lan 


to additional data from 


stimulation that are also interpreted as wi 


position. In conclusion, 
problem is suggested. 


A recent exchange of theoretical notes 
(Black & Lang, 1964; Smith, 19642, 
1964b) was concerned 


with the question of whether or not curare 


drugs . . . forestall Tesponses in skeletal 
muscles [Smith, 19642]. 


One of the main theoretical issues was 
one earlier expounded by Smith (1954). 
This is the general problem of whether or 
not autonomic nervous-system (ANS) 
conditioning can occur as à primary phe- 


t is invariably an 


sponse mediation of conditioning, The 
paper by Black, Carlson, and Solomon 
(1962) dealt specifically with ANS con- 
Teservations had 
mainly to do with the actual effectiveness 
of the curare drugs in reducing skeleta] 
motor reactions to such a low level that 
they could not reasonably be expected 
to be the primary mediators of the ANS 
conditioned responses. 

In Black and Lang's (1964) reply, 
muscle-potential data were presented 


а combined experimental 


g skeletal-motor accompani- 
ens the theoretical position 


with brain 
eakening Smith's theoretical 
attack on the 


showing that the curare drugs had indeed 
greatly reduced the level of backgroun 
muscle tension, In fact, at the атре 
gains used the muscle potentials «m 
to appear in the tracings. In ami _ 
(1964a) reply he stated that the muscle 
potential data were far more convincing 
than the unaided observations of o 
responses reported earlier, and he in n 
faced that he would be willing to abandon 
his earlier theoretical position if {аг 
Studies of this kind continued to provi 
data contrary to this position. a) 
Ty note will deal with two points: ( е 
а technical point concerning the relative 
adequacy of the muscle-potential Ws 
and (5) another line of attack on Smit f 
theoretical Position, different from that O 
experimentation with curare drugs. " 
Black and Lang's (1964) muscle-poten 
tial data Tepresent a real technical wt 
vance, but as they realized! their da e 
would be much more informative we! 
they to employ an amplifier with a e 
Fequency response up to 200 cycles ra 
second and with considerably more 82^" 
Such that good recording could be a 
tained at the level of 1 microvolt Р 
millimeter, Б 
. My second point, as previously po 
tioned, is concerned with a different li 
1A. 


ication 
1964.” H. Black, personal communica 
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= attack on Smith’s (1954) position: 
he aap the sensory pathways and stimu- 
pe close to the "motor area" for the 
a reaction, thus avoiding peripheral 
eS arion (e.g., electric shock) that un- 
ael y produces strong concomitant 
of ca motor reactions (in the absence 
ri drugs). A recent experiment 
Шо] то, 1963, р. 26 ff.) has demonstrated 
ne easibility of this brain-stimulation ap- 
чуно With lateral septal stimulation 
keen ae ii stimulus in a classical 
Ше her experiment, the uncondi- 
cordi neart-rate slowing reaction was 
fale pos a tone in a group of 20 
artifact ранып was ruled out as ап 
sm (Malmo, 1963, p. 31 ff.; 1964). 
cluded pog evidence it was also con- 
tions aat the heart-rate slowing reac- 
A Te probably not artifacts of 
lion AM reactions to brain stimu- 
studied though such reactions were not 
fens as systematically as respiratory 
Cds T (Malmo, 1963, 1964), it was 
oe um when movements did follow 
little a anlatan (some animals showed 
in kind no reaction), they varied widely 
subject pi in intensity from subject to 
med A ways, however, being associ- 
Bis еш 2 heart-rate slowing (for meth- 
and fol oyed see Malmo, 1963, p. 26 Ё.; 
question Published data bearing on the 
acts is Of possible skeletal motor arti- 
р, 31 ee Malmo, 1963, p. 18 ff. and 
"do the evidence from this ex- 
tlie cond appeared strongly to support 
tions Ue ш that the heart-rate reac- 
primey itioned and unconditioned, were 
skeleta] and not in any way artifacts of 
Positive et Again, the major 
the wid ped was that in contrast to 
tory am" varying patterns of respira- 
subject t Skeletal motor reactions from 
WEIS co © subject the heart-rate reactions 
ing. e M in the direction of slow- 
the Boss evidence strongly supported 
elicited clusion that septal stimulation 
piles an autonomic reaction that was a 
this т, У reaction, and furthermore that 
action was conditioned. 
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The artifact notion of Smith (1954) 
seems extremely difficult to defend in the 
presence of these data, especially when 
they are considered together with those 
of Black and Lang (1964). If at higher 
gain muscle potentials still fail to appear 
and conditioning occurs when curare 
drugs are injected, using the procedure 
of Black and Lang, the results of course 
would be quite conclusive. On the other 
hand, in case there is some slight residual 
muscle activity revealed at high gain, 
the peripheral shock stimulus could be 
replaced by central stimulation in order to 
produce heart-rate slowing with little or 
no activation of the skeletal motor sys- 
tem; and under these more favorable 
conditions, injection of a curare drug 
should completely eliminate all muscle 
potentials. 
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TNI TNA 
IN DEFENSE OF A NEW APPROACH TO OLD PHENOMEN 


NORMAN J. SLAMECKA 


University of Vermont 


The soundness of Dallett’s (1965) correction procedures as pibe te 
data from the method of association was recognized, but his conclu- 


sions about the existence of remote associa: n 
ground that they lacked statistical confirmation. 
arguments and experimental data were evaluated 
adequate. It was concluded that the doctrine of re 
still unsupported and that alternative explanations 


Evidently the doctrine of remote associ- 
ations (RAs) can still summon some sup- 
porters to its tattered banner, Dallett 
(1965) and Bugelski (1965) have risen 
to the challenge implicit in my paper 
(Slamecka, 1964) and have offered some 
interesting arguments and data in de- 
fense of the traditional doctrine. Their 
remarks, together with this reply, should 
inspire wider Scrutiny of an aspect of 
associationism which has for too long 
been taken for granted. 

Dallett confined his remarks solely to 
the treatment of data obtained by the 
method of association. He described a 
rational ad hoc procedure which corrects 
for opportunity, and also for stimulus 
and response biases in Such data (the 
latter being only an approximate correc- 
tion), and eventuates in an expected 
gradient which is to be compared to the 
actually obtained gradient of presumed 
RAs. He applied this Procedure to the 
data of my Experiment IV, which in. 
volved no serial learning, and also to 
some serial learning based data of his 
own and presented the resulting gradients, 

Presumably, if those elusive RAs are 
actually operating, a comparison of the 
two gradients would show a Systematic 
difference between them. At low degrees 
of remoteness the obtained values should 
lie well above the expected and should 
then slope steadily and progressively 
downward until they lie well below the 
expected values at the highest degrees of 
remoteness, thus generating a monotonic 
gradient whose slope clearly exceeds that 
of the expected gradient. This is a per- 


tions were disputed on the 
Bugelski’s (1965) 
and judged as in- 
mote associations 15 
are stronger. 


E i- 
fectly acceptable rationale which const 
tutes a real advance in methodology. бей 

However, when Dallett (1965) нен 
Presented the gradients from ey o o 
data and claimed that ie cna 
Figure 2 reveals that the obtained the 
gradient is in general steeper ne px 
expected one [p. 167],” I cou thing. 
longer agree. It reveals no gf that 
The difficulty with his conclusion p ctive 
it was really unsupported. No obje at 2 
test was provided by which to arriv * е 
valid comparison of the gradients. 


Е ‚ some 
visual inspection, accompanied by is а 
subjective judgmental process, alu- 


totally unacceptable method for eY The 
ating those particular curves. hout 
courses of the obtained points ане я і 
his Figure 2 show erratic revers? ono- 
every case rather than the steady. rhet, 
tonic declines to be hoped for. pon Са 
in the graphing of the background E ues 
ents the "—1" degree (whose ihould 
extend upward so extremely far) * ause 
not even have been included. Pod 
that degree represents adjacent ett 
ations and not remotes. The differ opea" 
among the remaining frequencies " h 
relatively unimpressive; and sine ë 
standard errors of such data m jil- 
relatively large, it is a very real ports 9 
ity that the curves lie within the lim 
random variation. i 

Dallett’s chief contribution to the urc 
is his sophisticated analysis of the о as- 
of bias that surround the method ine 
Sociation; but, as far as the doe ет” 
remote associations is concerned, th 
регог is stil] wearing no clothes. 


ssu? 
5 
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a (1965) critical remarks 
a n extensive that I have chosen 
ont only to those that seemed most 
tinent. These points will be taken 
up in the order in which they came in his 
paper, i 
ax ее assertion that. I assume 
Blániü a position curve "requires no ex- 
qe р is false, as is his assertion that 
tor ee the question. The "ques- 
Fi E. hand concerned the validity of 
dee ge surrounding the existence of 
М 5, апа it was far from being begged. 
Y statement (Slamecka, 1964) was, 


данене the ultimately accepted explana- 
ФАР» пе serial position curve will be (and 
Problem). ‘still being published about that 
soelanana it is unlikely that remote as- 
ions will play any part in it [p. 74]. 


Te recent evidence suggests that 
upon an a curve may be based 
tine (Eb earning of relative item posi- 
rather d enholtz, 1963a; Jensen, 1962) 
of pr than upon some complex bundle 

F esumed RAs. 
ieee the use of some rather 
tempted a arguments, Bugelski at- 
hat seri 3 shore up the old assumption 
irect i learning is a matter of forming 
Presentation associations. However, his 
with th ton is somewhat out of keeping 
Serves : newer experimental facts and 
hold of mur to illustrate the powerful 
findings ^ Ebbinghaus tradition. The 
(19635) T such students as Ebenholtz 
(1962) 3 nderwood (1963), and Young 
ulus eg it quite clear that the stim- 
Preceding item in a serial list is not the 
ceding M or even a cluster of pre- 
bu уш that serial learning 
the iter y fixing the relative positions 
era] БА was а recognition of the gen- 
that; of those findings. I quite agree 
Seventeenth | were asked to name the 
Counting | letter of the alphabet without 
also a he would probably miss it. 
re d that one would not answer with 
Come xs „е, or v, w, x, y, z, but would 
letter asonably close to the seventeenth 
It is undeniable that in serial 
one does, in fact, get to know 
it about the ordinal positions of 


Prog 


“arning 
luite а h 
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the items (Schultz, 1955), but further re- 
flection indicates that this is not the 
central aspect of the process. Thanks to 
Bugelski's clarity in putting the question, 
the answer can also be made clearer. I 
believe that it is the relative positions of 
the items that are being learned, and only 
incidentally their ordinal positions. Thus, 
the appropriate type of question to ask 
our subject would be: “What letter ap- 
pears later in the alphabet, ј огт, еог1, 
p ог t?" and the like. I predict that опе 
could answer such relational questions 
with near-perfect accuracy. No amount 
of juggling with hypothetical RAs could 
account for the feat. 

3. Apparently Bugelski is willing to 
concede that the derived list technique 
is practically useless for providing real 
proof of the validity of RAs, and I agree 
wholeheartedly. Nevertheless, the logic 
of that technique demands that a first-or- 
der list (having the strongest “remote” 
associations) be learned faster than a 
scrambled control list (with necessarily 
weaker associations on the average). 
My first three experiments showed that 
such an outcome occurs only if the list 
is a regularly patterned one and (a) 
is either derived from a familiar sequence 
such as the alphabet, or (b) the subject 
is aware of the principle of patterned 
derivation. 

Bugelski also complained that the use 
of an alphabet list tends to “confuse the 


Since the reason for using an 


issue.” 8 
Experiment I 


alphabet sequence in my 
was not well taken, one need only consult 
Experiments IT and III which made the 
same basic points with nonsense syllables ! 
These experiments provided strong sup- 
port for a perception-of-patterning ration- 
ale and gave no comfort to the remote 
associations doctrine. 

4. Bugelski had two objections to my 
experimental demonstration that the 
overall results of the method of associ- 
ation are artifacts of response biases and 
can be produced without any serial learn- 
ing whatever. First, he felt that a pre- 
familiarization frequency of 25 for the 
first item and of 1 for the fourth item 
was not a good analogue of serial 
learning, since it implies that 25 trials 
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are needed to master a six-syllable list. 
Second, he was alarmed over the finding 
of such a high proportion of backward 
associations, in spite of the fact that 
previous studies have always reported a 
substantial number of them. Even Dallett 
(1959) reported more backward-remote 
than forward-remote associations at each 
of his three retention intervals, Would 
Bugelski also consider those data “fan- 
tastic” ? 

In order to indicate that neither of the 
two objections is crucial to the issue, 
Table 1 is presented, It depicts some 
hypothetical data from the method of as- 
sociation, illustrating a perfect response 
bias from the serial position effect dur- 
ing original learning. There is a ratio of 
2 to 1 between the first-item and fourth- 
item responses. Since in my Experiment 
IV a ratio of more than 4 to 1 was ob- 
tained when the respective practice fre- 
quencies were 25 and 1, it follows (as- 
suming a logarithmic relation between 
trials and free-response Probabilities) 
that the six-item list of Figure 1 might 
be learned in about seven trials, In this 
table there are more forward than back- 
ward associations (90-72), more adjacent 
forward than adjacent backward (27-22), 
and more remote forw; 
backward across degrees ] through 4 


(58-50). Having satisfied Bugelski's re- 
quirements these data 


TABLE 1 


HYPOTHETICAL DATA OF THE METHOD of 
ASSOCIATION ILLUSTRATING 4 PERFECT 
RESPONSE Bras 


Response item 


in serial order) 
Stimulus 
items 1 2 3 4 5 6 
(X) 6 5 4 3 4 5 
1 ms 4 3 4 5 
2, б — 4 3 4 5 
3 6 5 — 3 4 5 
4 6 5 4 = 4 5 
5 б 5 4 3 — 5 
6 6 5 4 3 4 = 


Note.—The number in each сеј 
which that response was made t 
during the association test, 


is the frequency with 
9 the given stímulus 


THEORETICAL Nores 


s 
will still show the usual pattern bid € 
as a function of degree of remoteness E 
thus indicate that the objections wem E 
really important a. criticism О: 
response-bias rationale. . durs 

3. Bugelski felt there was an inconsist- 
ency in my position that (a) “artifacts 
method of association RAs are Сен 
of response overlearning (due to D) with 
tial practice frequencies), and (0 yera” 
the method of anticipatory and pera item 
tive errors they are anias o о 
underlearning (insufficient learn! is su- 
item positions), This oberta OR с 
perficial since it overlooks the rea 
aspects of the two situations. > 

With the method of association E p^ 
ject is free to emit any list respons m d 
therefore the determining ww ^ 
Prior frequency is allowed fu an 
With the method of апер ке 
perseverative errors the subject is a are 
in serial learning, and his ec ^ irf- 
necessarily more constrained. P list, 
ing to anticipate the next item in " 

SO he refrains from uttering 058 позе 
have already passed and those a we 
identity and positions are | ap a 
enough to be recognized as ше ро5ї- 
that point. The influence of ES nar- 
tion is evident in the ne menie d in- 
rowing range of the list from W ats 
trusion errors come. Such constrai ndi 
Not exist in the former, free-resp^? hoc 
Situation, These explanations ge oe d at 
and properly so, since they are 2! testing 
two quite different and specific 
conditions, 

6. Now for consideration of d 
trial experiment which suppose tho 
butted my entire position on the Sor with 
anticipatory and perseverative wesented Я 
one fell swoop, Bugelski ag ne 
Serial list to classroom groups ted the 
*xposure only and then presen e that 
third word and asked for the OM sere 
followed it. Half of the peg t 
correct, and the remainder were Я 


one- 


c ; eque 
uted in Progressively decreasing p ес 
cies as their distance from the that 


Word increased, Bugelski аста, of ir 
would predict a random dissi as col 
trusions, so he viewed the гези 


art 
n 
trary to my position, The seco 
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of the study was an unaided serial recall 
test of the entire list. Bugelski assumed 
that I would predict a negative correla- 
tion between the number of items cor- 
теу learned as to position, and the 
p to which they occurred as errors 
iine first part of the study. The ob- 
à correlation was positive, so 1t was 
again viewed as contradicting my posi- 
tion, Г 
T ieee data do not constitute a rebuttal 
the З ша. Bugelski's rationale for 
cross u y was evidently based upon a 
the o versimplifoation of my account of 
е ната of serial learning. It must be 
lated: Dered that two stages were postu- 
host а response-learning phase and a 
SI ton-learning phase, I also said 
lamecka, 1964), 


the n н 
as subject + +. Occasionally makes a guess 
hap О the next item, uttering any one he 
those р to know [italics added] except 
whose EE has just seen... and those... 
identity a iti 5 : 
Ip. 72], y and position have been fixed 


ыр tee e offer a response which he has 
column g ien, A glance at the last 
E 9 Bugelski s Table 2 shows that 
early jeomseilearning phase favored the 
таб. fe and that there was a syste- 
ist Жо; in item availability аз the 
Went me Items 7 through 10 
Subjects eported by more than half of the 
equally obviously they were not all 
с ri vailable as intrusions ! The use 
Mate the words as items does not elim- 
Cisterc os Tose learning phase from 
Ind ion at all. 
tainly ар conditions І would сег- 
of errors prede a random distribution 
Very disp would instead expect the 
namely ribution „that was obtained, 
esiones e that mirrored the pattern of 
items, w availabilities. The first two 
both y vnose identities and positions were 
intrusion learned, were rarely given as 
tions, E again according to expecta- 
“qually n if all of the items were 
rro; kenn s I would predict a random- 
No Dositi = ution only if there had been 
The mo; ton learning accomplished at all. 
ШЕН ment that position learning showed 
> "ere would be a restriction of the 
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range of intrusions according to the serial 
position function. 

This same differential response learn- 
ing renders Bugelski’s correlation data 
confounded with respect to its interpre- 
tation and makes his subsequent conclu- 
sions untenable. The only condition un- 
der which a negative correlation between 
intrusion frequency and degree of posi- 
tion learning should be found would be 
when all the items were equally available. 
The correlation would simply reflect the 
progress of the position-learning phase, 
and its magnitude should increase as the 
degree of position learning increased. 

An appropriate method of testing my 
position within the framework of Bugel- 
ski's own study would be to provide the 
subject with a scrambled listing of the 
items for reference during the recall tests. 
This would guarantee equal item avail- 
ability and would allow the results to re- 
flect position learning unconfounded by 
differential response learning. With 
such a procedure a curvilinear distribu- 
tion of errors should result, as well as 
the above-mentioned negative correlation. 

After having evaluated the remarks of 
these critics, I am more than ever con- 
vinced that there is less than meets the 
eye in the doctrine of RAs, and that the 
tide is surely ebbing for the house of 


Ebbinghaus. 
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SPATIAL VARIABLES, OBSERVING RESPONSES, 
AND DISCRIMINATION LEARNING SETS" 


FRED STO 


LLNITZ 


Brown University 


An observing response i 
discriminative stimulus. 
of occurrence of an observing 
| area, and (b) inversely with sp: 
response, With these assumptions, 
the effects of these 2 spatial varial 
monkeys, chimpanzees, and children. 


respo 
at 
ol 


variables, but, surprisingly, the 
learning-set training. 
few trials in learning- 


discrimination problem can overcome 
difficulty persists through extensive 


It is concluded 
set training results in extinction of 


s any response that results in exposure to a 
2 assumptions are made: that the probability 


nse varies (a) directly with cue 
jal separation between cue and 
bserving-response theory explains 


bles on discrimination learning by 


Prolonged training on a single 
the difficulty produced by these 


that changing problems every 
any observing 


response that might be reinforced within problems. 


Tics cde rhesus monkey is trained 
ctiminatio. trials on each of many dis- 
criminati n problems, it forms a dis- 
1949 mes learning Set (Harlow, 
gra ЖЕ )- That is, its performance 
брза. changes from practically no 
tion on each problem to rapid so- 

of each new problem. How 


rapid? Е 
? One-trial discrimination of 
1B 
teed 5 
(Stol Sed in part on a PhD dissertation 


fusi D 1963a) written while I was a Na- 
y Saas Foundation Graduate Fellow. 
рег y riments and the preparation of this 

: с» supported by United States 
8 ealth Service Research Grants 
t and MH-07136 from the National 
chrier, who di Health. Allan M. 
inc x о directed the dissertation, gave 
теч, helpful criticism, and kind 
nderson A Comments by Kenneth V. 
тоок, С onald S. Blough, Judith L. 
мола R Mies S. Harris, Judith R. Harris, 
М. Stearns. Meyer, V. J. Polidora, Elizabeth 
Impr, ns, and L. Benjamin Wyckoff have 
ved this paper. 
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three-dimensional objects is well 
known. In fact, learning-set forma- 
tion has been described as “converting 
a problem which is initially difficult for 
a subject into a problem which is so 
simple as to be immediately solvable 
[Harlow, 1949, p. 56]." This view 
was supported by the results of. Har- 
low and Warren (1952), who found 
that rhesus monkeys (Macaca mu- 
latta) could form a learning set when 
trained with 3-inch squares cut from 
the pages of magazines. The initial 
great difficulty of these discriminations 
was gradually overcome, and eventu- 
ally many problems were solved in 


single trials. 

So it was quite reasonable for Bla- 
zek and Harlow (1955) to expect that 
performance differences between ini- 
tially easy and initially difficult prob- 
lems would gradually disappear during 


learning-set formation. They trained 
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Fic. 1. Four spatial variants of a color- 
discrimination problem, (A, whole-card 
center cue; B, whole-card border сне; С, 
part-card center cue; D, part-card border 
cue. The colored areas—shaded—cover 25% 
of the base card in A and C, and 75% in 
B and D. C and D are shown with a food 
well uncovered after a response.) 


rhesus monkeys to discriminate be- 
tween differently colored Squares cen- 
tered on 3-inch-square white cards 
(Figure 1, Part A). Problem diffi- 
culty was varied by using colored 
Squares of different sizes: The colored 
Squares covered 100%, 50%, 25%, or 
12.5% of the 3-inch-square bases, with 
both stimulus cards of a given prob- 
lem having the same size colored 
squares, ts surprised even 
As expected, per- 


formed a learning set, 
keys never learned to d 
the small colored Squares as with the 
large ones, Even after 768 six-trial 
problems, performance levels remained 
directly related to the size of the col- 
ored squares. The differences among 
performance levels increased at the 
start of the experiment but were con- 
stant over the last 576 problems, 
From these results, Blazek and Har- 
low (1955) concluded that mor 
ing could not be expected to 
the differences in performance. 
In a similar experiment, Schrier 
and Harlow (1956) trained Java mon- 
keys (Macaca irus) on 432 10-trial 
problems with colored squares cover- 


O as well with 


e train- 
reduce 


ing 100%, 38%, or 25% of ау. 
square white cards. Раана H 
25% and 38% problems were eps 
but persistently worse (by pocas 
correct responses) than perform 

on 100% problems. " 

The | cea of problems like i 
shown in Figure 1, Part A, was К 
covered 2 decades ago age 
1945), but the variables that п " M 
them difficult have been identified oe 
recently. This paper will show nces 
the persistent performance баена 
found by Blazek and Harlow C as 
and Schrier and Harlow (195 uns 
well as other effects of the same ав 
bles, сап be explained by an oe рош 
tion of Wyckoff’s (1952) theory 15- 
the role of observing responses iid be 
crimination learning. It shou s not 
noted that the present paper ^ rma- 
attempt to explain learning-set стаі 
tion, but only to show why wn 
results have been obtained in а Con- 
ber of learning-set experiments. xperi- 
ceptual developments followed e the 
mental discoveries, of course, a! to 
Teverse path will be followed he toric 
avoid some dead ends, The ere 
byways have been traced by 
Treichler, and Meyer (1965). 


EORY 
OssznviNc-ResPoNsE THEO 


b- 
à he 0 
Wyckoff (1952) introduced conve 3 
Serving-response concept as à ding 
ient way to deal with atten rimina 
discriminative stimuli in ea п ol 
tion learning experiments. here as 
serving response is defined ppost 
апу response that results in e7 «Ех 
10 а discriminative stimulus. pica" 
1! 

А 3 app 
* This definition is more vaden i in CE 
ble than "any response which rest as im 
posure to the pair of aet Кр wi 3 
Wrolved [Wyckoff, 1952, p. 43 ` respons 
Wyckofr's definition, an OSEE Seri p 
in a Successive-discrimination а stimuli 
would have to last for two or Mefnition a 
presentations, Moreover, his q simult 
inappropriate for at least som 
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OBSERVING RESPONSES 


posure” implies that the discriminative 
stimulus is exteroceptive; thus, ob- 
Serving. responses can be distinguished 
from’ other chained responses that 
ne only proprioceptive stimuli 
repu x in press). But an observing 
iig ta th is still a link in a chain lead- 
esti € response on which reinforce- 
is based (the effective response). 
йон and extinction of ob- 
itis ^d responses are determined by 
аван variables | that determine 
chained а and extinction of other 
esponses. 
d m of this paper, "response" 
servin ective response’ unless ob- 
Шеш Ө се is specified. “Cue 
кою iscriminative stimulus,” as its 
Wyck R Кү suggests. Following 
Probabilit (1952), p, represents the 
ing reg y of occurrence of an observ- 
ponse. 
ET kd Subject is not exposed to a 
ously теш responding is obvi- 
пель ше Therefore, every 
ing res кеш task requires an observ- 
even сие for correct performance, 
ply Sindis observing response is sim- 
Буып " with the eyes open. _Para- 
бый the situations in which an 
oe am Fisco. is most likely to 
Osee е in which its necessity 
Wyekote Cs to be ignored. Even 
тар when } 52, p. 431) fell into this 
no Observi 1e wrote of Situations where 
that is to a response is required of S, 
is ER . . . Situations in which 
timinative to be exposed to the dis- 
jor + stimuli on each trial or 
© each effective response (Po = 


Neous-die. ss 
For igiscrimination experiments, as well. 
Serve oni nee, retarded children often ob- 
"d dus of a pair of objects on each 
i Fees hey have learned the discrimina- 
Шу pcan & Shepp, 1964). Similarly, 
Sd monkeys (Macaca memes- 
t AI simultaneous-discrimination 
Jeg n in one trial, observe only the ob- 
(Lockhart they push aside on that trial 
» Parks, & Davenport, 1963). 


Ос 
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1).” When р, = 1, an observing re- 
sponse is just as necessary for correct 
performance as when f, is very low. 
The difference is that when ро = 1 at 
the beginning of training, no observing 
response must be acquired by the sub- 
ject, but when ро is initially very low, 
an observing response must be acquired 
before the subject can perform cor- 
rectly. The present paper is mainly 
concerned with situations in which ро 
is very low at the beginning of training. 

The following mechanism for acqui- 
sition of an observing response was 
proposed by Wyckoff (1952): "Ex- 
posure to discriminative stimuli will 
have a reinforcing effect on the observ- 
ing response to the extent that $ has 
learned to respond differently to the 
two discriminative stimuli [p. 435]." 
Later, Zeigler and Wyckoff (1961) 
found that either learned or unlearned 
differential responses to discriminative 
stimuli can reinforce an observing re- 
sponse, Thus, a discriminative stimu- 
lus does not have to be paired with 
reinforcement in order to be a re- 
inforcer (cf. Estes, 1958; Guthrie, 
1959; Premack, 1959; Wyckoff, 
1954). The “cue strength” (Wyckoff, 
1959), “signal value” (Berlyne, 1960, 
p. 87), or probability of response to 
the stimulus (Premack, 1959) deter- 
mines the reinforcing effect on the ob- 
serving response. L 

In the visual-discrimination experi- 
ments to be discussed, appropriate 
fixation and accommodation are as- 


sumed to be necessary for exposure 


to a cue? So a chain of observing 


3 They may also be sufficient for situa- 
tions in which the same visual dimensions 
are always correlated with reward and non- 
reward, as is true of the experiments to be 
discussed. To handle other situations, Zea- 
man and House ( 1963) have postulated that 
particular dimensions are covertly observed 
or attended to. Similarly, attending to non- 
visual cues sometimes requires covert ob- 


serving responses. 
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responses may include locomotion, 
head movements, eye movements, and 
accommodation movements of the cili- 
ary muscles. For practical purposes, 
accommodation is a covert response. 
Eye movements may be measured more 
directly (e.g., White & Plum, 1964), 
but this is not convenient when a 
monkey is running around a cage. So 
experiments may require arbitrary, 
easily measured observing responses 
to be added to the chain (Kelleher, 
1958; Wyckoff, 1952; Zeigler . & 
Wyckoff, 1961). Because each link 
in the chain is reinforced by the fol- 
lowing link, all observing responses in 
the chain are affected similarly by 
variables that affect the probability of 
visually fixating a cue. Therefore, 
knowledge about arbitrary, experi- 
menter-defined observing responses 
can be used to analyze situations in 


which the Observing response is not 
measured, 


SPATIAL DETERMINANTS Or OBSERV- 
ING-RESPONSE PROBABILITY 


Separation between Cue and Response 


Monkeys look wher 


e they put their 
fingers. Color- 


discrimination learning 
by macaque monkeys (Macaca mulatta 
and M. irus) is impaired when the 
cues are placed only 0.50 to 0.75 inch 
away from where the monkeys touch 
the display in making their choices 
(Schuck, 1960; Schuck, Polidora, Mc- 
Connell, & Meyer, 1961). To account 
for this influence of Spatial separation, 
Schuck (1960) Proposed the existence 
of “a gradient of visual sampling, with 
a maximum in the region of the mon- 
key's hand position in manipulation 
[p. 255]." 
The sampling gradient is interpreted 
here as a spatial distribution of фо for 
cues at various distances from the re- 
sponse locus. The quantitative prop- 
erties of this distribution are not yet 
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known. Discovering them would ae 
ably be made easier if MEUM d 
tween cue and response egi 
in terms of visual angle and distat 
in depth. m 
A pum properties of the ашыр 
are known, however. For A 
there is evidence that the distribu e 
is not identical for all br oca IN 
cue-response separation. pi ue 
in depth of about 1 inch do not anes 
performance (Peterson & Rum pro- 
1963), presumably because the ay i 
priate change in accommodatio me 
much more probable than an pos 
priate change in fixation for a t even 
separation of the same size. . Bu lane 
when cue and response lie in "ie d 
practically perpendicular to the I! onse 
sight, the effects of equal cue Tes. 
separations in different wv ЕГ, 
differ (Horel, Schuck, & ^s more 
1961). Moreover, when there 15 imul- 
than one response locus (as a БР 
taneous discrimination), the dis dif- 
tions around the several loci am 
ier. But, for a given response is as" 
and direction of separation, it pear 
sumed that as the separation incre? P 
Ро decreases. The farther the ' 
from the response locus, the less 
the monkey is to look at it. 


: he 
Cue is at the response locus, t 
is maximal. 


Cue Area 


d 
c an 
In the problems used by Blazer, 
Harlow (1955) and Schrier pem сей” 
low (1956), decreased area of t | with 
tered color cues was confounded, and 
increased separation between p? or" 
Tesponse (assuming response on 
ders of cards—see next section): s 
the other hand, increased a кй n 
confounded with increased = үе al 
in the experiment by Schuck “i ch 
(1961). Because a 4.75-sqUa"^. ine 
increase in area failed to elimina 
effects of introducing a 0.50- to 


n Ро 


OBSERVING RESPONSES 


inel cue-response separation, Schuck et 
oe (1961) concluded that area has 
negligible” effects. But this conclu- 
iig Should be justified only if perform- 
а were unrelated to area with con- 

nt cuc-response separation. Ex- 
ie holding separation constant 
mies e Lynn, 1961; Warren, 1953; 
е up 1961, 1962; Zimmer- 
i ^ orrey, 1965), described be- 
eil itia ve that cue arca does af- 
шо or discrimination by rhesus 
п Therefore, for a given cue- 
Sher E separation and at least for 
оге En n is assumed here that po 
Fi 2 the cue area increases. 
ends E o ably the visual angle sub- 
if the ES most important. Obviously, 
А ьт X so small that a monkey 
just bere varely detect it, then ро 15 
Убу 2s y greater than zero: Only a 
егш ecise fixation would be an ob- 
large E bg sie If the area is very 
js likely T change of direction of gaze 

y to "hit" the cue. 


LrAnNING-SET STUDIES 


is A m y learning-set procedure 
са аг іп ехрегітепіѕ оп dis- 
Of the тош arning Бу monkeys, most 
Я oe evidence relevant to 
Tom ex Yang -hespanse theory comes 
Many = = eg in which each of 
Constant blems was presented for a 
ally 6 to L7 number of trials (usu- 
Single groy ). In each experiment, a 
With the n Ei monkeys was trained 
тецеу c ЕЕ spatial variants (treat- 
8toups ar ubjects design), unless two 
uU the oe 

üratus isconsin General Test Ap- 
1 р » monkeys uncover food wells 
ie ee aside cards, plaques, or 
Ces gr a, When the entire sur- 
Color. fae such objects differ in 
touch a example, the monkey must 
he oh; K colored surface in pushing 
tiguous е Cue and response are con- 
^ Po is high, and the color dis- 
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crimination is learned rapidly. Now 
consider a pair of cards or plaques 
with only their centers colored differ- 
ently (Figure 1, Part A). If the total 
area of the cards remains the same 
(typically 9 square inches), the cue 
area is smaller. In addition, assuming 
that monkeys usually touch only the 
white borders in pushing these cards, 
cue and response are spatially sepa- 
rated. Decreased area and increased 
separation have the same effect: po is 
lower than before, and discrimination 
learning is retarded. This difference 
has been found repeatedly (Blazek & 
Harlow, 1955; Harlow, 1945; Schrier 
& Harlow, 1956, 1957; Warren, 
1953). Most of these studies have 
varied the size of the center cues and 
obtained performance levels that vary 
directly with size, as would be pre- 
dicted both from the changes in area 
and from the changes in separation. 
The two spatial variables are con- 
founded in these experiments, and ad- 
ditional evidence is needed to show 
that both cue area and cue-response 
separation in fact influence learning- 


set performance. 


Cue Area 

Still assuming that monkeys touch 
the borders of cards that they push 
from food wells, spatial separation be- 
tween cue and response becomes zero 
if the centers of the cards are white 
and the color cues are on the borders 
(Figure 1, Part B). Nevertheless, 
Warren ( 1953) found that decreasing 
the cue area (decreasing po) impaired 
the performance of rhesus monkeys on 
such border-cue problems. 

The influence of cue area has been 
confirmed by an experiment in which 
the place where the monkeys touched 
the display was known rather than as- 
sumed. Leary Lynn (1961) 
modified a test tray to hold stimulus 
cards over the food wells in such a way 


and 
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that rhesus monkeys could move the 
cards only by pushing on the pe 
edges. Colored 1-їпсһ squares an 
lx 3-inch rectangles were placed = 
equal distances from the near edges o 
the cards. With cue-response sepa- 
rations thus held constant, perform- 
ance was better with the larger cue 
areas than with the smaller ones, sup- 
porting the present analysis, 


Separation between Cue and Response 


Macaque monkeys perform better 
on border-cue problems (Figure 1, 
Part B) than on center-cue problems 
(Figure 1, Part A) when the cue area 
is the same (Riopelle, Wunderlich, & 
Francisco, 1958; Warren, 1953). This 
shows that small cue area is not the 
only cause of difficulty in center-cue 
problems, As mentioned above, cue- 
Tesponse separation js Presumably zero 
in border-cue Problems, so Ро should 
be greater for border cues than for 
center cues of equal area, 

Two experiments in which rhesus 
monkeys had to Push on the near 
edges of stimulus cards similarly indi- 
Cate the importance of Сие-геѕропѕе 
Separation (Leary & Lynn, 1961; 
Schuck, 1960). In both Studies, col- 
ored regions at the near edges were 
discriminated more accurately than 
colored Tegions of the Same area at the 
far edges, Cue-response Separation 
was clearly Tesponsible: Cue Position 
had no effect when Tesponse locus was 
uncontrolled (Lynn & Leary, 1962) 
or when monkeys touched both the 
near and far edges of the cards 
(Schuck, 1960). 

In the studies described so far, the 
monkeys displaced the Whole card that 
covered the food well, But Schrier 
and Harlow (1957) cleverly Presented 
center-cue problems to one group of 
rhesus monkeys as “whole-card” prob- 
lems (Figure 1, Part A) and to an- 
other group as “part-card” problems 
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(Figure 1, Part C), ‘Before the melt 
key responds, these variants loo in 
tical. But in part-card problems, udi 
base cards are fastened to А a 
tray, and the monkey puses ч the 
only the center square to uncov € 
food well. Performance was p n on 
part-card center-cue problems tha’ bee 
whole-card center-cue problema Т 
cause the monkeys touched the co But 
portions in part-card дыы 
when the part-card group was e 
to whole-card problems, they Mem. 
to perform well, presumably tare 
they continued to touch the er 
portions of the cards. peer 
fer of touching the cues faci in a 
form discrimination by prm 
learning-set experiment by 
and Fletcher (1964), rform- 
Schuck (1960) found that ae re- 
ance of rhesus monkeys was i bor- 
liably different on center-cue Sart-card 
der-cue variants given as jm d D) 
problems (Figure 1, Parts C D 
In pushing the center Eue БОШ 
monkeys had their hands nea wa 
border and center regions, 50 ur 
high for both border and cente xpe it 
To summarize 8 years of f colo" 
mentation with the four kinds p^ Fig- 
discrimination problem shown e pro Р 
ure 1: The whole-card сеш con- 
lem (Figure 1, Part A) por the 
Sistently been more difficult E^ jg the 
others for macaque monkeys. wot! 
Only one of the four in which Bas pro 
be Substantially lowered if, as aratio 
Posed here, Cue-response cy 4A 
Were an important iie фе д dis 
Tesponse to the border O porde" 
play results in higher f» v и 
Cues than for center cues, shot! 
Sponse to the center of a display es that 
result in higher 5, for center С e ma 
for border cues. This prediction. ап 
by Schuck et al. (1961), aque mon 
somely verified. When macad on 
eys responded by touching 
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centered on the stimulus screens of an 
тайв apparatus, they performed 
LM on center-cue problems than on 
Moe problems. The difference 
e rmance on the two spatial vari- 
eg iei during the experiment 
10% m was roughly constant (about 
"e rrect responses) over the last 
of 832 six-trial problems. 
ы Persistence of performance dif- 
m S with differences іп cue- 
сеи Separation was puzzling. It 
"ай Strange that an appropriate 
üited Е response could not Бе ас- 
E е Overcome е effects of a 
to on = Separation of only 0.50 
уя id vh h, as in the border-cue 
effect: ^ of Schuck et al. (1961). The 
fies oon much larger separations have 
whieh wee in other experiments, to 
€ now turn. 


S INGLE-PROBLEM STUDIES 


à ce a sate view of ро has been 
of Seti Ө account for the effects 
studies А variables іп learning-set 
through ae be assumed to remain 
tial valu LM experiment at the ini- 
Variant t etermined for each spatial 
Separation cue area and cue-response 
results “a _But, in contrast to the 
ing, m extensive learning-set train- 
Proben, ensive training on a single 
асан ig overcome the difficulty 
arations 5 relatively large spatial sep- 
er эм ween cue and response. In 
те not : 5, observing responses that 
ormation cquired during learning-set 
Оп sing] are acquired during training 
S'e problems, Thus, in this sec- 
Ae variables can be assumed 
raining ine f, only at the start of 
таці" and changes in f, during 
Жз Must be considered. 
Problem Portance of cue area in single- 
Strateq Performance has been demon- 
Thesus n experiments with infant 
1962. Monkeys (Zimmermann, 1961, 
' ?mmermann & Torrey, 1965). 
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In these experiments, the monkeys 
responded to nursing-bottle nipples 
that were always centered in the cue 
surface, so cue-response separation was 
constant. Color discriminations were 
learned much more rapidly with large 
cues (high fo throughout) than with 
small cues (low initial р,). Also, after 
a color discrimination had been learned 
with a large cue area, reducing the cue 
area (reducing po) impaired perform- 
ance. 

The rate of learning a single dis- 
crimination is inversely related to the 
distance between cue and response 
loci, as  observing-response theory 
would predict. This is true for chim- 
panzees (Gellermann, 1933a; Jenkins, 
1943) and children (Jeffrey & Cohen, 
1964; Lipsitt, 1961; Murphy & Miller, 
1959) as well as for rhesus monkeys 
(Carr & Brown, 1960; McClearn & 
Harlow, 1954; Meyer, Polidora, & 
McConnell, 1961; Miller & Murphy, 
1964; Murphy & Miller, 1958). In 
Murphy and Miller's (1958) study, 
monkeys learned a black-white dis- 
crimination within 400 trials when 
they pushed aside identical objects 
just below the cue objects, but failed 
to learn within 400 trials when the 
cue-response separation was 7 inches. 
This result was confirmed by Stollnitz 
and Schrier (1962). But, after vari- 
ous amounts of additional training, 
some of Stollnitz and Schrier's rhesus 
monkeys suddenly learned the dis- 
crimination with the 7-inch separation. 
This sudden learning indicates that, 
for at least some monkeys, the initial 
value of f, was greater than zero 
with a 7-inch cue-response separation. 
Wyckoff (1952) described a positive- 
feedback relation between discrimina- 
tion and observing-response learning 
that explains the sudden learning, as 
follows: Discrimination learning was 
retarded for quite a long time because 
Po Was SO low at first. But learning, 
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once started, was accelerated positively 
because every increase in differential 
responding resulted in greater rein- 
forcement of later observing responses. 
A similar explanation can be given 
for learning by some of the children 
trained by Murphy and Miller (1959) 
with a 6-inch cue-response separation. 
In several of these studies, some sub- 
jects failed to learn at all; presumably 
these subjects’ initial pọ was extremely 
low (perhaps zero). 

Rhesus monkeys trained by Stoll- 
nitz and Schrier (1962) with small 
cue-response separations were able to 
transfer their discrimination to succes- 
sively greater separations (see also 
Schrier, Stollnitz, & Green, 1963). 
Increasing the separation 4 inches or 
more in one jump seriously impaired 
performance, but even with an 18- 
inch separation some of these monkeys 
(and some Previously trained only 
with the 7-inch separation) made 95% 
to 98% correct Tesponses. These re- 
sults extend those of McClearn and 
Harlow (1954), whose monkeys 
reached over 90% correct responses 
with 0-, 1-, 2-, and 4-inch separations 
presented repeatedly. In both experi- 
ments, the discrimination learned with 
one separation could be transferred to 
larger separations, One of Murphy 
and Miller's (1958) monkeys also 
Showed some transfer from the 0- to 
the 7-inch separation when it con- 


tinued to touch the positive cue before 
making the effectiy 


е response. Ac. 
cording to the present theory, each 
time the separation is increased, р, de- 


creases, But even though an obsery- 
ing response is at first less likely to 
occur, every time one occurs it is re- 
inforced, hecause the cues still control 
the differential responding that was 
learned with smaller separations, So 
Р, increases faster when a discrimina- 
tion is transferred to a larger separa- 
tion than it would if the discrimination 
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d at 
were being newly learned with th 


Separation. ; 
"he would be expected, pups 
small separations between cue ied 
sponse impair learning less In fact, 
than do large separations. - both 
when the same problem is given na 
with no separation and ot the 
separations to the same S ae 
effects of spatial variables may 2 E 
small, because the initial depres” ‘ne 
Po is counteracted so rapidly Y chis 
transfer effect just described. riment 
apparently occurred in an ED enm 
by Meyer (1958), who trainec rimina- 
monkeys on a single color-disc ed оп 
tion problem with cues n ud cue 
3-inch-square bases. With о inches 
areas ranging from 9.0 square 
(100%) to 02 square inch (P, 
presented repeatedly, transfer ane 
100% variant to the four cen differ” 
variants made the performance j that 
ences small. Meyer concludes, ian 
“reducing the area does not E 
tially increase the difficulty nalysi 
problem [1958, p. 183].” But pend 
of variance? (sessions X Hai of the 
subjects) showed that the effec ratio" 
spatial variables (area and ay 
combined) was reliable (P <- pe 
The combined effect of cue аге 
cue-response separation was experi" 
from the transfer effect in an R P 
ment? by Stollnitz (1963а). 
monkeys were trained on а 
whole-card color-discriminatio’ Ww 
lem, using 3-inch-square plaq" 400% 
their entire surfaces colored ag, бё 
variant) or with colored MM 
tered on white backgrounds er 06 
variant). To eliminate ert re^ 
tween the variants, they We ; 
ide 
4 Donald R, Meyer kindly et Re- 
raw data. ra 
5 Попе while Т was а .Predon T. 
Search Fellow of the Nationa n i 


lic 
Mental Health, United States Pub 
Service, 
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sented to two different groups of 
monkeys. Practically errorless per- 
formance was reached after about 100 
trials with the 100% variant and after 
about 200 trials with the 12.5% vari- 
- Meanwhile, a third group was 
rained with the two variants alter- 
nating every 5 trials. After a total of 
ig: 100 trials, the alternating group 
en een making practically no errors 
tl a 00% variant, even though this 
aining included only about 50 trials 
ш the 100% variant. (The other 
Similar with the 12.5% variant.) 
ech ү: y, after about 100 trials with 
f variant, the alternating group 
akin as accurately on the 12.5% 
Sent oe did the 12.5 group after 
ian 00 trials with that variant 
fer oh So, although the combined ef- 
alii area and separation was still 
ies s 2 there was perfect transfer be- 
ges the two spatial variants in terms 
SA io of training required for 

55 performance. 
v with perfect transfer and con- 
{ее cue area, small separations be- 
fpes cue and response reliably retard 
Боп Ing of a single color discrimina" 
ie. Stollnitz (1963a, 1963b) trained 
“йр monkeys to discriminate the 
3.0 4 E narrow strips (0.2 inch X 
ена hes) on 3-inch-square white 
strai ht that could be pushed only 
ton E back.  Cue-response separa- 
to "rites varied at random from trial 
one of T placing the colored strip at 
d aen distances from the near edge 
errors ' plaque. The total number of 
tance WR linearly related to the dis- 
ГУ е strip from the near edge 
ormance » although errorless per- 
was eventually achieved with 


all 15 a; 

erim Distances. Apparently the dis- 
Small ation was learned first with the 
idly „o Parations and transferred rap- 


to > 
Serving the larger separations as ob- 
8 responses were acquired for 
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cues at greater distances from the re- 
sponse locus. 

The results of all these experiments 
would lead one to expect that a 0.5-inch 
cue-response separation would reliably 
retard color-discrimination learning by 
rhesus monkeys, although the effect 
might be small So Polidora and 
Fletcher (1964), like Meyer (1958), 
may have overlooked such an effect 
when they reported that the median 
numbers of days to criterion with 
and without a 0.5-inch separation were 
within 1 day of each other. They 
gave 64 trials a day until 12 consecu- 
tive responses were correct within 1 
day's session, so the median numbers 
of trials to criterion may, in fact, have 
differed by as much as 116 (64 + 64 
— 12). 

With or without transfer, then, spa- 
tial variables that reduce ро increase 
the difficulty of single discrimination 
problems. But long training on a 
single problem can overcome the diffi- 
culty, even leading to errorless per- 
formance on a whole-card center-cue 
problem (Stollnitz, 1963a). 


PERSISTENCE OF EFFECTS IN LEARN- 
iNG-SET FORMATION 


Why is it that prolonged learning- 
set training does not eventually reduce 
or eliminate the differences in diffi- 
culty caused by spatial variables? 
Blazek and Harlow (1955) offered no 
answer to this question. Neither did 
Schuck et al. (1961), although they 
saw that a satisfactory explanation 
must allow for the fact that the differ- 
ences can be eliminated by prolonged 
training on 4 single problem. They 
suggested that 
nses in these situ- 


trol of observing respor 


cor 4 
is by some visuomotor process that 


ations ) p 
is highly resistant to modification. through 
generalized, interproblem learning of the 


kind involved in learning sets. But changes 
in these processes сап and do accompany the 
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learning involved in the mastery of a single 
problem [p. 545]. 


This statement merely pushes the 
question into the realm of some 18005 
motor process.” Why does this pro- 
cess” resist change during learning-set 
formation and not during single-prob- 
lem learning? To put the question 
more directly, why does p, remain at 
or near its initial value throughout a 
learning-set experiment? 
The easiest answer would be that Po 
never changes from its initial value 
within any problem, and so necessarily 
remains constant (for a given spatial 
variant) throughout the experiment. 
This answer is most obviously true if 
Po is zero. If the cues are never ob- 
served, observing can never be rein- 
forced, so р, never Changes. But even 
if ро is greater than zero, at least two 
observing responses must be made be- 
fore р, could increase. The second 
observing response could be reinforced 
only if the subject had learned to re- 
spond differentially when it made the 
first observing response, Even assum- 
ing one-trial discrimination learning, 
then, any very low initial value of Po 
may result in a negligible chance for 
Ро to increase within a Short problem. 
In fact, with very short problems and 
very low po, there would be little 
chance even for discrimination learn- 
ing. Such an experiment was done 
by Murphy and Miller (1955), who 
trained rhesus monkeys on three-trial 
object-quality discrimination problems, 
With a 6-inch cue-response Separation, 
one group showed no sign of learning 
during 672 problems. The other 
group, responding by pushing aside the 
stimulus objects, formed a learning set 
but fell to chance performance when 
switched to the 6-inch Separation after 
480 problems. Presumably the 6-inch 
separation did not reduce Ро quite to 
zero, since rhesus monkeys can learn 
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a single discrimination with a os 
separation (Stollnitz & Schrier, od 

But р, does not have to be very ob: 
to remain constant within every pr for 
lem. Consider a spatial variant A 
which фе is extremely high. do 
ample, in the typical “100% : a 
discrimination problem, the su 3 у 
must touch the colored surface Ton 
responds at all. Because primates a 
where they put their fingers, ро is КОП 
to 1.0 at the start of —P € 
training. The same is true of аав 
quality discrimination when sub) 
push aside the stimulus p: 
course, the fact that р, is close ei? 
does not imply that discrimination P 
formance starts out nearly ар oe 
with an initially high p, the disc 
nation is learned relatively ae 
a learning set is formed, new Pd 
are solved more and more rapid | 
Ро remains essentially unchange y e 
cannot rise above 1.0 no matter re- 
much the observing response 15 its 
inforced, and it cannot fall below a 
initial value (close to 1.0) as Tn inis 
the subject touches the cues. pe 
Sense, the spatial conditions 
lower limit on Фо. 

What if the initial value ot Pa 
neither extremely high nor eae ) 
low? For example, what if fo | ing! 
at the start of discrimination ey ий 
Note first that a subject could ре оке 

% correct responses on a ET re- 
discrimination with a p, of .50 Tae 


H 1 e 
sponding correctly every time th time 


ems 
but 


is 
ly 


not: 
(chance level) when they e that 
So the simplest assumption stil 


: on 
enough for the observing meP et 
be reinforced, But in a lear 
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experiment, learning is cut off after 10 
trials or even fewer. Warren's (1953) 
within-problem learning curves show 
that. performance differences among 
Spatial variants are still present on 
Trial 10. Enough observing responses 
have been made by then to permit 
Some discrimination learning, but the 
Problem is changed before р, can in- 
crease, Discrimination learning must 
then begin once again for the new pair 
of cues. So, from the simple assump- 
tion that р, remains constant within 
one problem, it follows that pọ remains 
Constant throughout the experiment. 
^ That simple assumption may not al- 
m be the most plausible, however. 
pe Hh the initial value of po, the 
ter each problem would be solved if 
training were continued, and the fewer 
trials would be needed on a given 
Problem before the observing response 
B be. reinforced and f» would 
med à increase. So it could. be 
Within that р, might increase a little 
Bien: a short problem, especially after 
нетра. set has been formed and 
Get able discrimination learning 15 
fiber ihe within each problem. But 
is М, nat happens when the problem 
н Recall that an observing 
Exit d will be reinforced only to the 
tenth ү the subject responds dif- 
Set ES y to the cues. In a learning- 
Probie eriment, the cues of each new 
ential p do not at first control differ- 
Hir e esponding. Exposure to the 
Servin les will fail to reinforce the ob- 
of р E response. Even if the value 
Proben increased slightly during one 
Servin M, it will fall again as the ob- 
the A aec ә extinguishes when 
Will not problem is started. So Po 
Problem, increase progressively from 
Spatial до турше, and the effects of 
Out the ariables will persist through- 
Sort experiment. y 
flecha orate the operation of such a 
*chanism has been demonstrated. In 
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an experiment by Zeigler and Wyckoff 
(1961), pigeons had to step on a pedal 
to expose themselves to cues. This 
observing response was acquired dur- 
ing each of six discrimination prob- 
lems, but was found to extinguish after 
each new problem was introduced. 
The repeated acquisition and extinc- 
tion confirmed Zeigler and Wyckoff's 
prediction from — observing-response 
theory and directly supports the pres- 
ent analysis. 

Should one conclude that р, cannot 
increase progressively during learning- 
set formation? High initial values of 
po long problems, and fast-learning 
subjects might foster such an increase. 
But, to explain the persistence of the 
effects of spatial variables in the ex- 
periments reviewed here, it seems 
plausible to claim that any observing 
response that might have been re- 
inforced within problems extinguished 
again when the problem was changed. 
Thus po was held close to its initial 
level by this repeated extinction. 
After all, the first trial of a new prob- 
lem is always an extinction trial for 
the observing response, so extinction 
always has a head start. 


DiscussioN 


Extinction of the observing response 
when a new problem is introduced has 
been a hidden help to those who have 
trained monkeys on a number of spa- 
tial variants of color-discrimination 
problems using the learning-set proce- 
dure and the treatments-by-subjects 
design. Because such extinction pre- 
vents transfer among variants when 
the cues are changed, this economical 
design has revealed the same spatial 
effects as would be shown if a different 
group of subjects was trained on each 
variant. 

The theory presented here to ex- 
plain the effects of spatial variables is 
formally similar to Restle’s (1958) 
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theory of within-problem pee in 
learning-set experiments. In Rest ез 
theory, irrelevant stimuli ( type-c 
cues”) lose their control of behavior 
(“adapt”) in the presence of (“with 
respect to”) the relevant cues of a par- 
ticular problem ("type-b cues”). As 
a result, the irrelevant stimuli gain 
control again (are “released from adap- 
tation") whenever the relevant cues are 
changed. In observing-response the- 
ory, reinforcement of an observing re- 
sponse by exposure to particular cues 
results in extinction of any acquired 
observing response whenever these cues 
are changed. Some of Restle's con- 
cepts may be identified with the em- 
pirical variables that have been dis- 
cussed in this paper. For example, 
when cue and response are spatially 
separated, there must always be some 
irrelevant stimuli between them. If 
these stimuli are identified with Res- 
tle's type-c cues, then adaptation of 
these cues with respect to the relevant 
cues becomes functionally equivalent 
to acquisition of an observing response 
for the relevant cues, This approach 
Suggests the Possibility of applying 
Restle's logic to an observing-response 
theory of learning-set formation. Res- 
tle accounts for learning-set formation 


by postulating that both type-c and 


type-b cues adapt with Tespect to 
type-@ cues (cues that are valid 
throughout the experiment). This 


mechanism might correspond to 
quistion of an (covert?) observing re- 
sponse for the cues of reward and 
nonreward on the Previous trial, Such 
an observing response, of course, would 
occur in addition to the observing re- 
sponse for the relevant Cues, which 
has been the main concern of this 
paper. 

It is interesting to notice why Restle 
thought that his model of learning-set 
formation could not deal with observ- 
ing-response learning. Finding that 


ac- 
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his model fit the data of Harlow €: 
Warren (1952) poorly, Restle = 
gested that the monkeys in Mk re- 
periment were learning ag se Tes 
sponses that directed their m 
toward the centers and away from But 
borders of the stimulus cards. Ner 
that was the experiment ie quee 
in the introduction of this paper) W i B 
used 3-inch squares cut from us 
zines, which presumably differe en- 
much in their borders as in their i A 
ters. The discriminations were ай 
cult, but not because the monkeys ы 
to fixate the centers of the pans 
Spatial variables are important çam- 
wide variety of situations. For ee 
ple, facilitation of correct "app imt 
by transfer of a cue-touching гез oe 
has been mentioned earlier in yey anc 
tion with experiments by Schr "Miller 
Harlow (1957), Murphy and letcher 
(1958), and Polidora and o 
(1964). Perhaps through trans s 
preexperimental learning, Шей ай 
chimpanzees trained by Gelenn 
(1933a, 1933b) with cues Ser 
from the response loci “spontaneo with 
traced the contours of form cues "m 
their fingers before responding. the 
structions to trace contours with dis- 
fingers similarly facilitated form yeri- 
crimination by children in an eT og 
ment by Ruzskaia (1958), and M noit 
words helps children to p emet , 
them in learning to read (Pe ilar 
1943). АП of these seem to be si they 
cases of primates looking — a has 
put their fingers, rather than (a cu 
often been claimed) of Kinesthet a 
Supplementing or modifying lacking 
Perception. Other mammals, 14 
fingers, look where they put cen? 
noses. "Thus, touching the guen fi 
to be quite generally importe. 
efficient discrimination ро is-7 
(Ehrenfreund, 1948; Gardner 9 ats 
sen. 1948; Munn, 1931)—even 
look before they leap. 


a 
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Atkinson (1961) wrote that more 
experimental data were needed before 
Lie Concepts of observing-response 
neories could be related to empirical 
variables, such as the properties of 
stimuli. At about the same time, 
Schuck et al. (1961) recognized that 
the effects of spatial properties of 
stimuli should be explained in terms of 
observing-response theory. This pa- 
Per is an attempt to satisfy both re- 
quests, 
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NEW DEVELOPMENTS IN FACET DESIGN AND 
ANALYSIS? 
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In multivariate research design the systematic definition of the а 
variables in terms of more basic sets, the facets, leads to the predic п 
of the empirical interrelationship among the variables. 2 prin E 
ples are suggested for predicting the results from the facet чш 
of the variables: the principle of contiguity and the semantic princip 


components. The principle of conti 
more similar in their facet structu 
pirically. This principle does not 
facets in determining the relations| 
the differentiation problem is prov: 
semantic principal component, 

to be the special case of the fir 
tion of these concepts to a num 
areas suggests that they have 
that facet elements can be class 
set of variables and that vari 
to be related to the set of v 
nonspecific elements, Systematic des 
rect prediction of empirical results. 

havior several alternative form 
usually lead to different hy 
design rather than anothe 
vestigator, it appears also 
formation and to the influe 


pr 


The 
nber 


ified i 
ables 


potheses. 


Behavioral studies tend to become 
wider in scope and to in 
number of variables, 
complexity of data 
than ever, Systematic design and a 
rigorous statement of hypotheses, 
Facet metatheory is an approach to 
the treatment of this problem of multi- 
variate design and analysis, It is in. 


clude a larger 
The increasing 
requires, more 


1 The work on general m 
design and analysis descril 
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alizations a 


guity simply states that variables 
re will also be more related em- 


ovide for a differentiation among 


hip. A nonparametric approach to 
ided by the more general concept o 


Principle of contiguity turns out 


st principal component. The applica- 


of studies in different behavioral 


Predictive power. It is further shown 


nto specific and nonspecific to € 
containing specific elements tend 


3 1 n. 
ariables more than variables containing 


ign alone does not guarantee cor- 
In fact, for a given area of be- 
Ppear possible, and they will 
While the choice of a given facet 


r may depend on the intuition of the in- 


to be related to the psychology of concept 
nce of language on this process. 


"i 
tended to help and guide the im 0 
of Ше investigator rather t псер 
substitute for it. А scientific wed 
is constructed or invented by Y^ of 
ing certain features from the 'ertaiti 
experience and disregarding € 


еј 
m ch 4 
Other aspects, In deciding whi pic! 


has 


Same universe of experience 
conceptualized in many | o 
Ways, depending on the po П 
investigator ; and some ae fiere? 
belong to the same or to eb 2 
concepts according to the feature 
chooses to consider. 

Concepts become accepted ү a 
ers, Or become “scientific,” 1150 ric 
they lend themselves to neat e™P 


th- 
y 0. 
al 


Facet DESIGN AND ANALYSIS 


relations, One of the difficulties be- 
setting behavioral science at present is 
precisely that different investigators 
make different choices, thus precluding 
the possibility of comparing the find- 
ings they obtain. Investigators are ap- 
parently spurred to try new concepts 
by the fact that concepts previously 
used rarely yield neat and orderly 
data. А necessary criterion (albeit 
not a sufficient one) for a good theory 
4 x it should lead systematically 
lle correct prediction of empirical 
s. Facet design and analysis sug- 

Best an approach to such а testing. 
ae concepts presented in this paper 
> ееп developed in specific areas of 
Seen The presentation will however 
sien т the formal rather than the sub- 
antive aspects in the hope that this 
ш facilitate the application of this 
a ereach to other areas. But, again, 
i н: no intention to suggest that a 
i. problem can be approached 
uelim purely formal viewpoint. The 
ems ee of design and analysis 
cede stantive theory are closely en- 
likel : and no research problem seems 
in о be solved by considering one 
м ct only (Brunswik, 1951; Ham- 

попа, 1951). 


Facer DESIGN 


mz e that variables included in a 
arms ot design should be defined in 
not е their component elements is 
was к Its use in behavioral research 
fap E DAR by the desire to apply 
eld hows factorial design to this 
1930, ав, 1949). In the early 
а fact ;irunswik and Reiter employed 
iene design in studying the im- 
Seis value of schematized faces 
used ie 1956, pp. 100-108). They 
eing d facial schemes, each scheme 
ele efined by different combinations 
nd ее. of the forehead, eyes, nose, 
aces nouth. For example, one of the 
is defined by high forehead, wide 
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eye separation, medium-tip and short- 
saddle nose, high mouth. Brunswik 
(1956) discusses at length the prob- 
lems related to systematic and repre- 
sentative design and gives many other 
examples of its use, drawn mainly from 
the field of perception. 

A similar approach is proposed by 
Stephenson (1953). One of the ex- 
amples given by him (see pp. 69-70) 
is a redefinition of Jung's psychological 
types in terms of Attitude (introver- 
sion, extraversion), Mechanism (con- 
scious, unconscious), and Function 
(thinking, feeling, sensation, intuition). 
This leads to 2X2X4=16 types. 
One type, for example, is "unconscious 
extravert feeling." 

Guttman (1954b) suggested that this 
manner of defining variables can be 
formalized by using the notation of the 
Cartesian product and introduced the 
term facet to indicate a component set 
of the product. By doing this, Gutt- 
man opened the way for further 
developments. 

To illustrate this notation let us use 
again Stephenson’s example of Jungian 
types. The three facets are: A, Atti- 
tudes, with two elements: a,, intro- 
version, and as, extraversion; В, 
Mechanism, with two elements: bi, 
conscious, and D», unconscious ; and ©, 
Function, with four elements: Ci 
thinking, Cs. feeling, Cs; sensation, and 


с intuition. The Cartesian product 
ABC indicates the sel 


t of 16 types; 
the notation abc: indicates one ele- 
ment of the pro 


duct, namely, conscious 
introvert thinking ; other elements are 
abis аз 


b,c,, and so on. 

Some of the variables thus defined 
are more similar among themselves than 
some others. Variable a,b,c, for ex- 
ample, is more similar to a,b,c. than to 
abaca The first two variables have 
the elements of Facets A and B in 
common, while the first and last vari- 
ables have no common elements. 
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This particular way of detti psy- 

chological types or faces, as in Bruns- 
wik’s study, is only one of the many 
possible alternatives. Different defini- 
tions can be thought of, using different 
facets; and these different definitions 
will show different similarity patterns, 
The process of constructing a concept 
consists in retaining certain facets 
and disregarding certain other facets 
from the data of experience. Facet de- 
sign formalizes this intuitive approach 
by spelling out the facets which are 
retained, but it does not tell, a priori, 
which facets should be spelled out in 
the definition. The choice of facets is 
a substantive rather than a methodo- 
logical problem. Facet analysis per- 
mits, however, a test whether a particu- 
lar facet design produces similarity pat- 
terns which are confirmed by empirical 
results. Thus the conceptual structure 
of the variables may suggest hypotheses 
about their statistical relationship. 


PRINCIPLE oF Conticurry 


A possible relationship between the 
conceptual and empirical structure of 
the variables is Proposed by the prin- 
ciple of contiguity (Foa, 1958). This 
principle states that variables which 
are more similar in their facet structure 
will also be more related empirically. 
Using this principle in our example we 
shall predict that the relationship be- 
tween a,b,c, and a,b,c, will be higher 
than the relationship between a,b,c, 
and a,b,c. 

The use of the notion of ¢ 
or proximity or similarity for predict- 
ing the relationship among variables 
had been limited in the past to the 
rather special case in which the vari- 
ables could be located along some 
physical dimensions, such as loudness 
of sound, brilliancy of light, color, posi- 

tion in the physical space, and the like, 
More often, in the absence of a clearly 
defined variable space, this notion was 


ontiguity 
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resorted to in order to explain iod 
found relationship rather than for he 
dicting it. An illustration of this = 
planatory use is given by d 
(1954a, especially p. 339). 2 4 
explanatory argument the р 
а similarity pattern is inferred je 
the empirical relationship. On * 
other hand, in the principle of ex 
tiguity the empirical relationship is P н 
dicted from the similarity pattern xi 
facet elements. In the facet d 
proximity acquires a precise om 
It is now proximity in terms Ot e 
elements. Thus it becomes possib A " 
use it for predicting the relations 
rather than for explaining it. 4 
The first indication that the principle 


ctive 
of contiguity may have some predi 


he 
power was obtained in a study of t 0 
interpersonal relations between men. 
factory workers and their 51 paw. 
In this study the relationship be ana- 
32 interpersonal variables was aver- 
lyzed. It was found that, on the ments 
age, variables with more facet ele! yari- 
in common correlate higher than nents. 
ables with fewer common = fol- 
The average correlations were x two 
lows: variables with elements © with 
facets in common, .23; variables mon, 
the element of one facet in com ele- 
10; variables with no common 
ments, — .01 (Foa, 1958). ү than 
In fact this study goes бага f 
Proposing the principle of con! c Ву” 
It also suggests the more spec? а 
pothesis that variables having 
facet elements in common will : 
related than variables having 
facet elements in common. р 
ciple of contiguity is necessary t 
hypothesis to be true, but mo n 
cient. To support it, it is m y^ 
sary that the various facets v іп de 
approximately the same weig "Face 
termining the relationship. If, much 
contributes to the relationship рар” 
more than Facets В and C, it may 


more 
fewer 
in 


pis 
or t 
вий“ 


есеб” 
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pen that variables differing only in 
the elements of Facet A will be less 
related than variables differing in the 
elements of both B and C. 
Psi a hypothesis of counting 
from y acet elements was derived 
E he more general principle of 
Bio id (Foa, 1958). The distinc- 
Ee these two notions was not 
i. a however, in the analysis 
the facet structure of a simplex 
(Guttman, 1959). 


Tur 
HE Facer SrRUCTURE OF A SIMPLEX 


p this work Guttman (1959) re- 
тм a study of intergroup beliefs 
mee of 580 white Brazilian re- 
f ents toward the Negro. Three 
acets were used: 


y iet A. The Zevel of intergroup be- 
ath with two elements: a, belief, 
Po overt action. 
uo B. The type of intergroup be- 
Бу Ог with two elements: Ъ,, compara- 
x and b,, interactive. 
а С. The referent to which the 
subjects behavior is ascribed: c,, the 
‚ Jects gr i 
himself. group, and c, the subject 
Т А 
dem Cartesian product of these three 
" ‘ts defines variables such as the fol- 
Wing ones: 
1, Р А 
S авз, Stereotype: belief (a,) of 
Ject that his own group (c,) (ех- 


cels — ; Н 
(b) does not excel) im comparison 
1) to Negroes. 
: abc, Norm: belief (а) of a 


Subj : 
beh that his own group (c,) (ought 
er ү: в not) to interact (b) with 


Беор Hypothetical interaction: 
(cy) (a) of a subject that he himself 
with Мт not) interact (bs) 


egroes. 

acri, Pas Personal interaction : Overt 
(to. (а„) of the subject himself (с) 
to) interact (b) with 
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These four variables are ordered by 
similarity or contiguity: a,b,c, is the 
most similar to a,b;c,, differing only in 
the element of Facet B. Next comes 
а,Ь,с, which differs from a,b,c, in the 
element of Facets B and C. Likewise 
a, b.c, is the best neighbor of a,b,c, and 
of а,Ь,с, and so on. Indeed the four 
variables so ordered form a perfect 
Guttman scale. 

The principle of contiguity now sug- 
gests that Variable 1 will empirically 
relate most highly to Variable 2, less 
to Variable 3, and least to Variable 4. 
Variable 2, in turn, will be more re- 
lated to Variables 1 and 3 than to 
Variable 4, and so on. 

Scores, ordered from favorable to 
unfavorable to the Negroes, were as- 
signed to the 580 white respondents in 
the four variables, on the basis of their 
answers to questions pertaining to each 
particular variable. These scores were 
then intercorrelated. The intercorre- 
lation coefficients confirmed the pre- 
diction, with only one slight deviation.” 

The correlation pattern obtained is 
a simplex, a statistical structure which 
is characterized by the possibility of 
ordering the variables along a line. 
Those nearer on the line correlate 
among themselves higher than those 
more distant (Guttman, 1954a). Itmay 
be of some interest to note that con- 
tiguity alone is not sufficient for pre- 
dicting the simplex. It has to be sup- 
plemented by the more severe criterion 
of counting facet elements. To give 
an example let us compare the rela- 


2 For details on this point see the tech- 
nical appendix to this paper which has been 
deposited with the American Documenta- 
tion Institute. Order Document No. 8249 
Auxiliary Publications Project, 
ervice, Library of Con- 
Washington, D. C. 20540. Remit in 
advance $1.75 for microfilm or $2.50 for 


photocopies, and à k y 
Chief, Photoduplication Service, Library of 


Congress. 
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tionship between the second and 2 
variables, on one hand, with the rela- 
tionship between the second and fourth 
variables, on the other hand. The 
simplex pattern requires that the 
former relationship be higher than the 
latter one. Now the former two vari- 
ables differ in the element of Facet B; 
the latter ones differ in the elements 
of Facets A and C. Contiguity alone 
does not tell which relationship will 
be higher. It may be possible indeed 
that Facet B will contribute to the rela- 
tionship more than A and C combined. 
If, however, one assumes that the vari- 
ous facets have approximately the same 
influence on the relationship, then 
counting the common facet elements 
predicts that the relationship between 
the former two variables (differing in 
one facet only) will be higher than the 
relationship between the latter ones, 
differing in two facets. This is pre- 
cisely the prediction required by the 
simplex pattern, 

In a semantic Guttman scale the ele- 
ments of the facets are related to the 
order in a monotonic Way: As one 
moves along the order, fro 


m the first to 
the fourth vari 


able, the elements of the 
facets change Progressively, and none 
Of them resumes its former values, 
First Facet B changes from b, to b,, 
then Facet C, and finally Facet A. All 
the facets behave as the first principal 
component of the order, 
These four variables are а subset of 
the eight possible variables defined 
by the three dichotomous facets. The 
variables not included in the ordered 
subset are: a,b,c,, a,b,c,, ayb,c., and 
a,b,c, 
By choosing appropriate variables 
from the total set of eight variables, it 
is possible to form other scales, in addi- 
tion to the one which has been dis- 
cussed. Thus several subsets are 
scalable, but the set as a whole cannot 
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be ordered by the principle of conti 
te е », Variable a,b,c, 
guity. For example, pH 
has three neighbors, a,b,c, nee 
a,b,c.; the principle of peg S rs 
not say which one of the neig 
is most related to a,b,c,- dby 
In a set of eight variables, defined d 
three dichotomous facets, the er а 
of contiguity does not suggest aug E 
ing of the whole set; only certain n 
sets can be ordered. Indeed this f p 
ciple does not differentiate omoti ef; 
various facets in determining wee "m 
If all the facets play a similar pie bà 
the order, the eight variables etra 
represented as the vertexes of eee 
the dimensions of the cube рэла, 5 
three facets, Three dimensions pe y 
necessary to order the set, as same 
dimensions as facets. For qb 18 
reason, the principle of pei on or 
sufficient for proposing a circu! снаге, 
der, ie, a two-dimensional stru d by 
in a set of four variables define con- 
two dichotomous facets. Let ү fined 
sider a set of four variables, ba Asi 
by Facets A and B with riri as 
a, and b, and b, respectively: 
follows: 


a,b, 
a,b, 
a,b, 
aab, 


Paa a pt 


to 
The first variable is or 

the second (in Facet A) anc second 
fourth one (in Facet B), ue st and 
Variable is contiguous to the ale to 
the third ones, the third varia 

the second and the fourth, an first 
fourth variable to the third an tiguitY 
Thus, if the principle of or in 
holds, the four variables w1 rrelat€ 
Circle; each variable will o and 
higher with its two neighbor Er 
Ower with the remaining wari ure? 
amples of such cireumplex d т 
have been reported by Сита 
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Press) in the reanalysis of earlier pub- 
lished correlation matrices of intelli- 
Bence tests. In these examples Facet 
A is the type of ability with two ele- 
ments: a,, achievement, and a., analyti- 
cal ability. Facet B is the language 
of communication of the test, again 
EM two elements: b,, numerical, and 
a pictorial. Tests less similar in 
acet structure correlate less than tests 
of more similar structure. Thus the 

Circular structure appears. 
Ne o) principle proposes the 
do g of the variables of the set in 
(di nany dimensions as the number of 
iie. ee facets. It does not con- 
— i possibility that a smaller 
ihr оп] oi dimensions will be sufficient 
Seah the variables. Such a 
ne гі implies | a differentiation 
кы жы a facets in determining the 
the ны facets will contribute to 
и. nination of the order тоге 
пе ain other ones. A nonpara- 
vided h m to this problem is pro- 
Principay he new notion of semantic 
oscillator d (Foa, 1961). The 
opone? unctions called principal 
maticians. s are well known to mathe- 
in the ks have been used already 
ences, peca and behavioral Sci- 
them em ne (1954c) identified 
S ni age in scalable attitudes. 
аз such n муса аге formal and, 
Mur р e employed ina variety 
Such diffe areas without implying that 
ive un rent areas have any substan- 
sists in arity. The novelty here con- 
“адеп E the notion of principal 
ceptual = in the analysis of a con- 
al án Tucture rather than an empiri- 
€, as done in earlier applications. 


Sr 
EM 
ANTIC PRINCIPAL COMPONENTS 


Le 

a e consider a set of eight vari- 

et defined by three dichotomous 
S as follows: 
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> a,b,c, 
. a,b,c, 
а,Ь,с, 
ар 
а,Ь,с, 
а... 
а,Ь,с, 
гаре 


о м бул мн 


There are several different ways to 
order these variables in a circle. The 
problem is how to predict a particular 
order. Let us suppose, for the sake 
of argument, that we predict the above 
order. In this order Facet A behaves 
as the first semantic principal com- 
ponent; it changes value only once. 
Facet B behaves as the second com- 
ponent; it changes value twice. Facet 
C behaves as the fourth component; it 
changes value four times. The pos- 
sibility of a third component will be 
discussed later. 

Reversing the argument one can 
now say that knowledge of the facet 
components tells a good deal about the 
order of the variables, but not every- 
thing. An alternative order, which 
preserves the components, may indeed 
be obtained by interchanging, in the 
above order, Variable 1 with Variable 
2, Variable 3 with Variable 4, and so 


on, as follows: 


a,b,c. 
a3 4€, 
< 84b.6; 
. 84b.€. 
а,Ь,с. 
а„б„с, 
a,b,c, 
agb,Ce 


Мм бош сз ко 


To determine a unique order it is 
also necessary to identify the first vari- 
able of the order. If the first variable 
is a,b,c, then the alternative order 
bove is ruled out. Identification 
ble is also required for 
уееп the first and 


given a r 
of the first varia 
differentiating betv 
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the second components. Since the 
proposed order is circular, one could 
otherwise start, for example, from the 
third variable; then Facet A would 
behave as the second component and 
Facet B as the first one. 

The first variable cannot be identified 
simply by indicating the subscripts of 
its facet elements. If these subscripts 
are assigned arbitrarily, any one of 
the variables can be indicated by a,b,c, 
or by any other combination of sub- 
scripts we wish. It is therefore neces- 
sary to assign subscripts to the elements 
of the facets in a meaningful manner, 

In conclusion: To predict 
order, for the eight variables d 
three dichotomous facets 
sary and sufficient to ide 
Cipal component of each 
first variable of the order, 


An approach to the solution of these 


a unique 
efined by 
› it is neces- 
ntify the prin- 
facet and the 


behavior. Thi 
the next sectio 


STRUCTURE OF INTERPERSONAL 
BEHAVIOR 


( observer), 


Facet B. The level: b 
is done), and h 
be done). 

Facet C. The per. 
of view of whom th 


x actual (what 


» ideal (what ought to 


son from the point 
e action of a given 
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actor is perceived, or alias: c,, the other 
(nonactor), and c,, the self (actor). 


The Cartesian product, ABC, of 
these three perceptual facets defines 
eight perceptual types, such as actual 
behavior of Actor, from the point of 
view of Alias,, and the like. 

The three behavioral facets are: 


Facet D. Content of behavior : d, ac- 
ceptance or giving, and d,, rejection or 
taking away; 

Facet E. Object of behavior: e, ш 
other (nonactor), and e, the sel 
(actor) ; t 

Facet F. Mode of behavior : f,, social 
or status, and fa, emotional or love. 


The Cartesian product, DEF, of 
these three behavioral facets defines 
eight behavioral types, such as emo- 
tional acceptance of self, social rejec- 
tion of other, and the like (Foa, 1961). 

The Cartesian Space for an observer 
is defined by the product ABCDEF. 
Each variable belonging to this gne 
is defined by a profile, over the o 
facets, of the form ajbe.die, f, (19 
Іт = 12). Thus 2° = 64 variables 
defined for each Observer. The struc- 
ture of the ABCDEF space as a whole 
is discussed in the next section. 
Our problem here is how to predict the 
Order of the eight perceptual types: 
When the behavioral type is held con- 
Stant, and the order of the eight E 
haviora] types, when the perceptua 
type is held Constant. 'That is: the DI 
€r of the variables ABC (diêm д) 
for any given Чї and the order О 
the variables (арус) DEF for any 
Siven ас, 


t has been noted 


а 
already that 
necessary con, 


dition for predicting the 
Order is to know the component of each 
facet. "Te is now proposed that these 
facets оғ interpersonal behavior de- 
velop at different stages of the process 
of Socialization jn the child. Among 


=e 


= | = 
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рн facets it is suggested that 
күш between actors will de- 
уге. + followed by differentiation 
"x 1 evels and then between ali- 
ией п the behavioral facets the sug- 
tent T аы of development is con- 
differ bject, mode. Each successive 
bg erum is obtained by a subdi- 
Bis > the previous concept accord- 
the Dal ne new facet (Foa, 1964). In 
eee у эш facets, for example, the 
E erentiation is, by content, into 
e: ri den and rejection. At the sec- 
splits i - each one of these concepts 
ip Үч two new concepts according 
Sedi : acceptance of other and self, 
рь n of self and other. In the next 
ie Sach one of these four concepts 
4.5 dichotomized by the mode 
Site re social and emotional accept- 
his fis rejection) of self (or other). 
ation mies С pattern of dichotomi- 
variable 1886515 a circular order of the 
the se S in which the first facet of 
сіра] quence behaves as the first prin- 
the Ee po the second facet as 
ing E component, and the remain- 
Pn = the fourth component (see 
different: ). The sequence of concept 
to the o n by facets corresponds 
the ж er of the components. Since 
is са of concept differentiation 
not ap ү, the third component does 
ae Odd components are anti- 
clement ic; they have a different facet 
Wen sith the first and last variable. 
аге $ mponents, on the other hand, 
"a cape More generally, it 
When "x suggested tentatively that, 
er ( ne predicted order is а circum- 
Coefficia, circular order of correlation 
ikely ura even components are more 
Variable appear since the first and last 
б Бас аге also neighbors. When 
May he risa simplex, odd components 
simplex more likely. Two examples of 
Which , es, in sets of eight variables, in 
h one of the three facets behaves 

ird component of the order, have 


been found recently by H. Triandis of 
the University of Illinois.? 
| Let us now go back to our study of 
interpersonal behavior. Using the 
components of the facets, as determined 
by the sequence of development, we 
can predict that the circular order of 
the perceptual types may be a,b,c, 
a,b,c, AybeCe, abu, агас, азс, 
a;b,cs, a;b,c;, and the circular order of 
the behavioral types, Фе, Фе, 
d,ef, def, def, def, d.e;f,, 
а,е, Ё. DM 
The Cartesian product of the 8 per- 
ceptual types X the 8 behavioral types 
defines 64 variables of interpersonal 
behavior as perceived by a given ob- 
server. To study the intercorrelation 
among perceptual types, we shall 
choose sets of 8 variables having the 
same behavioral type and observer and 
differing in perceptual type. Like- 
wise, to study the intercorrelation 
among behavioral types, we shall keep 
the perceptual type and observer con- 
stant and let the behavioral type vary. 
These variables were observed in a 
sample of 633 married couples in 
Jerusalem, Israel. The procedure for 
gathering the data has been described 
elsewhere (Foa, 1962). Here it may 
suffice to say that the frequency of 
perceived occurrence of each one of the 
variables was recorded as reported by 
the observer (husband or wife) by 
means of а three-question scale. The 
scale score on each variable was then 


intercorrelated with the score of other 
Sixteen intercorrelation 


computed for the рег- 
holding the behavioral 


variables. 
matrices were 


ceptual types, : 
type and the observer constant. Like- 
wise 16 intercorrelation matrices were 

hold- 


r the behavioral type, 
ual type and the ob- 


(Foa, 1962). 


computed fo 
ing the percept 
server constant 


3H. Triandis, personal communication, 


1964. 
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In all these matrices the ане 
circumplex order is confirmed. | "a 
coefficients are high near the main bx 
agonal; as one moves away from p he 
coefficients tend to decrease and then 
to increase again as the main diagonal 
is approached from the other side (see 
Footnote 2). | 

These examples suggest that if we 
are able to make assumptions about the 
sequence of development of facets we 
may also be able to Propose hypotheses 
about the order of variables defined by 
these facets: earlier developing facets 
will behave as lower components, 

It has been noted already that 
knowledge of the facet components is 
not sufficient for determining a unique 
order. To this end it is also necessary 
to identify the first variable of the 
order. Some considerations leading to 
a possible solution of this problem will 
be introduced in the next section, 


SPECIFICITY OF THE Facer ELEMENTS 


So far we have been concerned 
with the relationship between a vari. 
able and every other variable belong- 
ing to the set. Let us now consider the 
relationship between a variable and the 
set as a whole, The set may be desig- 
nated by a facet which is constant and 
common to all the variables belonging 
to it; in our example this facet is inter- 
personal behavior, 

The set “interpersonal beh; 
be seen as a subset of 
which may be called 
other subset of the 
“personal behavior,” 
defined as the hehavior which does not 
require the actual or potential partici. 
pation of more than one person in or- 
der to occur. 


In some of the facets of interpers 


avior" can 
a larger set 
"behavior," An- 
behavior set is 
which May he 


nal 


behavior one element characterizes 
variables which helong to the interper- 
sonal set only, while the other element 


indicates variables which belong also 
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to the personal set. Thus in m i 
alias, and object facets the ele 
other defines variables belone ed я 
the interpersonal set only, EN in 
element self defines variables w 
also belong to the personal set. jd 

Likewise, the element social o Lun 
facet mode defines variables belonen 
to the interpersonal set only, pi d 
element emotional indicates varia 
belonging to both sets. 

pes ida of the ved ae n E 

is suggested by the clements of е 
maining two facets, content and нее 
Variables with elements accepta al 
and actual must occur in --— a 
behavior, while variables with s 
elements rejection and ideal id 
may not occur. Rejection and "e 
behavior are not necessary for the L 
currence of interpersonal imma ‘ch 

Let us call a facet element w pt 
belongs to the interpersonal yet О a 
or which must occur in interperso 
behavior, a Specific element. — NS 

Thus the elements other, Spei fac 
Ceptance, and actual are specific QU 
terpersonal behavior; while the ed 
ments self, emotional, rejection, * 
ideal are not specific. . dc 
ess technically, the notion of SP 


T + А llow- 
ificity can be described in the uer 
ing manner: Relations with i 
must occur 


in interpersonal bee 
While relations. within sly 
need not. Social behavior is cere! 
interpersonal while emotional prc 
Can occur without reference to ot na 
Acceptance must occur in hene 
behavior if the behavior is to e ue 
to be interpersonal, while a be- 
leads to Cessation of interpersonal. е 
havior, Actual behavior is тира 
егрегзопа1 situation while 


e 
" : an b 
Concept of the ideal is not and са 
maintained in 


In e 
elemen 
behavi 


obviously, 


in an int 


à covert cognitive a m 
ach of these comparisons Aal 
t associated with anjer pen is 
or is labeled “specific.” 


— 
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now proposed that this classification of 
the elements of the facets is helpful in 
predicting the multiple correlation be- 
tween a variable and the other variables 
ot the set. Given two variables, one 
including the specific element and the 
other one the nonspecific element of a 
given facet, and with the same element 
composition in the other facets, the 
multiple correlation of the variable with 
bs specific element will be higher than 
he multiple correlation of the variable 
ee the nonspecific element. The 
1 cm support this hypothesis (Foa, 
). For example, when the ele- 
ment acceptance appears in the variable 
its multiple correlation is always higher 
im the corresponding multiple corre- 
ation of the variable in which the 
“ement rejection appears. In the other 
сев еге are some deviations from 
е) P вн but the hypothesis 15 
i ificantly sustained by the results of 
sign test (see Footnote 2). 
€. approach to the analysis of the 
in ationship between a variable belong- 
à to a set and the set as à whole 
ggests that some facet elements are 
Specific to this set while some others 
“a not. Variables containing the spe- 
à © element are more strongly related 
O the set than those which do not 
contain it, 
не... nonspecific elements may be 
in asa link between a particular area 
n other neighboring arcas. As such 
tl y may prove helpful in clarifying 
Ne relationship between different areas, 
ны, ultimately, to more general 
To constructs. 
hit each facet one element is 5р 
tlie 4 the other is nonspecific. Thus 
Єй elements сап be ordered ac- 
nein to their specificity. Let US 
ins the subscript 1 to the specific 
eiit and the subscript 2 to the 
се an. one. This convention has 
stra followed in defining the facet 
cture of interpersonal behavior. In 


ecific, 


this manner a meaning has been given 
to the subscript; it indicates the spec- 
ificity of the element. It is now 
possible to obtain a unique order of the 
variables determined by the principal 
component of the facets. To this end 
it is sufficient to start the order from 
the variable containing only specific 
elements, 1.е., from Variable a,b,c, in 
our notation, and to continue with the 
other variables according to the prin- 
cipal components of the facets. This is 
precisely what has been done in pro- 
posing an order for the perceptual and 
behavioral types of interpersonal be- 
havior. The alternative order would 
be obtained by starting with Variable 
a,b,c» Further study is required to 
find out when to predict one order or 
the other one. For this particular case 
of interpersonal behavior the meaning 
of the order which has been chosen is 
quite clear. It just proposes that, in 
the perceptual types, the ideal and 
actual perceptions will be more related 
for the alias of the actor than for the 
alias of the nonactor. In the behavioral 
types the behavior toward the self and 
toward the other will be more related 
in the emotional than in the social 
mode. More generally, given the four 
variables а, 0.8, ась, 80:62, апі 
a,b,c, two variables differing in the 
element of B and alike in C will be 
more related when the element of C is 


than when it is specific, 


nonspecific, Cs, 
b,c, are closer 


c,. That is, a,b,c, and a; 
than a,b,c, and а, Б.с. 
Specificity provides а 
ground for relating the components of 
the facets to the order of the variables. 
Moving along. the order the specificity 
of the first facet changes only once. 
In the second facet it changes twice and 
jn the last facet four times (see Foot- 
note 2). | 
Another application of the notion of 
specificity is in predicting the empirical 
structure of the ABCDEF set as а 


common 
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whole. This notion, indeed, lends itself 
easily to a structural interpretation. 
Specific elements belong to the 
ABCDEF space only; it seems, there- 
fore, that they will be found in the 
variables situated in the central portion 
of this space. Nonspecific elements, on 
the other hand, belong to other spaces 
as well and are therefore likely to 
appear in the variables situated on the 
periphery of the ABCDEF space, 
nearer to other spaces. This struc- 
tural interpretation of specificity is 
supported by the finding that specific 
variables are related to the set as a 
whole, more than nonspecific ones, It 
seems therefore that the ABCDEF 
structure will be such that specific 
variables will occupy a central position 
with respect to nonspecific ones, 

It has been seen едг 


empirical circumplex, 
the whole ABCDEF 
of intersecting circumplexes 


either a hyperspheroidal surfa 
anchor-ring Surface. The anchor ring 
or torus is а figure 


с Shaped like the 
inner tube of a car, 


Ina spheroidal surfac 
tion between cent 
not apply: None of 
a surface seems to b 


This conf 


ce Or an 


nonspecific ones, үү 
tion is possible, as in 
dicted structure js 
In this figure variables on 
part of the surface (wher 
valve is found in a car tube) 
central than those situated on 
part of the surface, 

call a statistical struct 
arranged on the surfa 


nor ring, 
the inner 
е the air 
are more 
the outer 
We propose to 
ure of variables 
ce of an anchor 
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ring а ringex. Thus the notion 
specificity, and its structural ше з 
tation as centrality, leads to the pre “a 
tion of a ringex structure for th 
ABCDEF space. 

In this Pee the perceptual eee 
of the form ABC(dienfn) constitu 
small circles on the section of the Fi 
While the behavioral subsets Саби 
DEF constitute large circles € 
the other direction of surface, acros 
the perceptual circles. 

Each (rie circle crosses a 
perceptual circle once, and this Ир: 
of intersection locates a variable of 
form асе. Ё. 
he ringex hypothesis suggests a 
the Perceptual circles will be sma р 
than the behavioral circles. Aroak 
behavioral circles those шге, ris 
the inner portion of the surface wil t 
the smallest, and the size of the S 
circles will increase as one moves E 
the inner to the Outer portion. „йе 
test these Properties of the eut 
average correlation among varia з 
belonging to each given circle wu 
puted. This correlation may be inte С 
Preted as inversely related to the a 
of the circle, The results fully ee a 
the hypothesis, Tn particular, the 35 
age correlation of the behavioral ci 


: t 
cumplexes (abc) DEF is Бена 
When the elements аск are spect 
and b 


comes progressively lower 9* 
the elements change to nonspecific. га 
Pecific variables are more гз 
than nonspecific ones, and the distan 5 
1008 Таман ех frisranses ав опе mor 
rom the inner to the outer portion 9 
the ringex, - 
he notions of sequence of develer 
ment of facets and of specificity | - 
facet elements have led to the ped 
tion of Certain empirical structures, E 
circumplex and the ringex, respe h 
tively, When the facet design is E 
that these Notions do not apply; + 
empirical Prediction may be differer 


Facer DESIGN 


If the facets do not develop in a sequen- 
tial manner, an ABC design may pro- 
duce a cubex structure rather than a 
circumplex. If specificity does not ap- 
ply, we may have a spherex instead of 
aringex. Thus these notions may pro- 
vide criteria for differentiating among 
different empirical structures resulting 
from formally identical facet designs. 


Facets AND CATEGORY FORMATION 


" In the examples of studies described 
ere facet design has led to the correct 
n of . empirical result. It 
К d be possible to quote other ex- 
pue in which this did not happen. 
ү: ешш design alone does not guar- 
лед, that the data will sustain the 
" pes This is hardly surprising 
oem of the fact that alternative 
o gns are usually possible, leading 
different hypotheses. 
en may be suggested that correct pre- 
nd is more likely to occur when 
Жөн 18 а certain correspondence be- 
ü E the design devised by the inves- 
bw or and the manner in which the 
verse of experience being investi- 
i pes is categorized by. the subject. 
of Ме equivalence is a result 
E е coding operations of the subject 
үт than a property of the stimulus, 
fects the problem of inventing а satis- 
do Ory facet design appears to be 
be Sely related to the psychology of 
t formation. It becomes rele- 
a to ask with Bruner (1956, p. 8) 
a different conditions, situational, 
ee in the personal history of the 
tio ject may lead to a given categoriza- 
» " rather than to another one. Some 
for he experiments described by Bruner 
йт investigating problems of concept 
mation have a straight facet design. 
DM experiments, for example, are 
[m on a Cartesian product of four 
42 5, each having three elements (p. 
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Bruner recognizes the influence of 
language in category formation and 
suggests that individual differences in 
categorizing may be found in affective 
responses developed in early childhood 
before the acquisition of language. 

On the other hand, when the lan- 
guage provides a name for labeling a 
facet, this fact in itself may indicate 
that such a particular manner of classi- 
fying experience has gained acceptance 
beyond the experienced world of a 
given individual. Nevertheless, the 
development of language may blur 
the original pattern of concept for- 
mation. Zipf (1935) has shown that 
in several languages the length of 
a word tends to be inversely re- 
lated to the frequency of its usage. 
When the “automobile” becomes the 
"car," the identity of the component 
elements is lost. This may be one of 
the reasons why, in Indo-European 
languages, the sound used to designate 
a concept rarely suggests the compo- 
nent elements of the concept. Excep- 
tions are such multisyllabic words as 
chairman, newspaper, and underwear. 
In other languages, such as Chinese, 
new concepts may be indicated by com- 
bining the sounds of the component 
elements, which is equivalent to using 
multisyllabic words whose total mean- 
ing is a function of the meaning of their 
syllables. This latter type of language 
may present fewer obstacles in laying 
bare the mechanism of concept forma- 
tion than do Indo-European languages. 

One may suggest that the unificatory 
role of language may be lost in cross- 
cultural research. Yet it appears that 
a concept found in one language may 
usually be rendered in another one by 
a phrase, if not by a word (Brown, 
1956). It is therefore possible that 
certain basic facets may be common to 
different cultures. Some evidence is 
available on this point. 
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The study of intergroup beliefs and 
action which has been summarized in 
this paper was originally done in 
Brazil. The same design was later 
used in France with similar results 
(Foa & Guttman, 1962). 

In studying the structure of inter- 
personal behavior it has been shown 
that the same order of intercorrelation 
appears in respondents with a European 
and a Middle Eastern background, al- 
though the size of the correlation may 
change (Foa, 1964). 

As we gain more knowledge in the 
process of concept attainment and in 
the role played by language in this 
process, we may be better able to pro- 
duce satisfactory facet designs, On 
the other hand, facet design and analy- 
Sis may prove useful in advancing 
knowledge in this area, as well as in 
other behavioral investigations, An 
assessment of the value of facet design 
and analysis, as an approach to multi- 
variate research, will require 
applications. The Purpose of 
paper has been to give д view of 
present situation and to su 
the results obtained so far 
rant its further use, 


more 
this 
the 
Egest that 
may war- 
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THE MOTOR THEORY OF SPEECH PERCEPTION: 


A CRITICAL REVIEW * 


HARLAN LANE 


Camie Rewir 
enter for Rescarch on Language and Language Behavior, 


University of Michigan 


The motor theory of speech perception maintains that articulatory 


movements and their sensory feedback mi 
stimulus and the perception of speech. 


ediate between the acoustic 
The theory is based on exami- 


nation of changes in identification probability, identification latency, 


and discrimination accuracy effected by change 


stimuli. This paper reviews first those experimen 
vidence is presented: It is shown that 


the theory, then opposing e 


identification and discrimination functions for nonspeech stimul 
muli, when obtained under comparable 


differ from those for speech sti 
conditions. 


a motor theory of speech per- 
Кон maintains that “articulatory 
ел see and their sensory effects 
Bd r? etween the acoustic stimulus 
is e event we call perception 
[Liberman, 1957, p. 122]." The ob- 
Served sequence of events in a speech- 


Гое episode may be represented 


Ry > Sa = Ro; 


as response, Ry, generates an 
discri p stimulus, Sa, which leads to a 
ing a nl response, Rp. Accord- 
ceptio the motor theory of speech per- 

n, the expanded sequence of 


Mp. s 

hs Thig investigation was supported in part 
T iy Language Development Section; 
ee States Office of Education (Contract 

Sumber SAB-9265), and by the National 

D ce Foundation (Grant Number 05586). 


- Cross and B. A. Schneider have col- 
ch and 
К in this article. I have also 
aski by the comments of A. M. Liberman, 
draft ns Laboratories, who read an earlier 
тш first introduced me to the analysis 
indere, discrimination. But Iam chiefly 
Or ed to N. J. Peterson, Yale University, 
Feng аў suggestions, criticisms, and cor- 
ons 


lab : 
its Orated in the conduct of the resear 


s in synthetic speech 
ts cited in support of 


i do not 


events is: 


Ry TE S4 Rp 
T3 


Ry’ = Sp 


the acoustic stimulus leads to a covert, 
articulatory response, Ry’, whose pro- 
prioceptive feedback, Sp, leads to the 
discriminative response. 

The inference of mediating articula- 
tion in speech perception has appeared 
in several experimental contexts, in- 
cluding discussions of phonetics and 
phonemics, the discrimination of syn- 
thetic speech stimuli, the electromyog- 
raphy and acoustics of speech produc- 
tion, psychophysical functions for 
speech parameters, automatic speech 
recognition, disorders of speech and 
hearing, first-language acquisition, and 
second-language pedagogy.” Although 


articles whose find- 


2 Та addition to those 
the validity of the 


ings directly concern 
mediation hypothesis and are discussed in 
this review, the following articles have cited 
the hypothesis in discussions of theory or 
experimental results: Stevens (1960) : 
“Thus, in the synthesis process . - » a repre- 
sentation of the signal at the articulatory 


level will certainly occur . a similar 


representation may likewise exist at some 


275 


HARLAN LANE 
276 


the hypothesis that speech is ) wee 
by reference to articulation has no 
been elaborated into a formal theory, 
its recurrence and central role in these 
discussions suggest that a critical re- 


stage during the reverse process о 
recognition [p. 537”; Fant (1963) : 
the reference to articulation primaril 
a function within the metala 


f speech 
“To me 
у serves 


discern forei 


Y ear without 
aving mastered their Production | |. [p. 
“ 


imilarly, we may 
listens to Jill, his 


++ in learning a foreign 
anguage one has considerable difficulty 
hearing Correctly until 1 
nounce co; 


word pairs 

altered in p 

ing when related to revious Psycholinguistic 

Tesearch Suggests that Soun discrimination 

ability is а function of articulation 
” 62) . 


[ 


Te ап Organic defect pre- 
vented the acquisiti € motor skill 


language but 
evidence was Presented for + е acquisition 


lls as require 
plete understanding of 1 


anguage [p 4241”; 
also see Chistovich, Klaas 
р. 173); Delattre (1958, р, 228); р 


(1958, р. ; Lane (1965, т. 32); and 
Thurlow (1963, p. 469), 


f 
view is warranted. The us es 
this article is to bring toget i find- 
examine the various стаи ав 
ings that have been pe нонат per- 
favoring a motor theory of xt ndis 
ception, to present opp te on Gat 
and interpretations, and to sho idence, 
in the light of the available ee 
the postulation of a special oe of 
mechanism for the discriminati 
speech is unjustified, ге been 

The research findings that Bay hese 
interpreted as favoring this verbal 
have in common that the overt h re- 
behavior of the listener (Ro) ee 
Sponse to speech is more close ‘tation 
related with the patterns of articu those 

v) that typically produce soustic 
Speech signals than with the a this 
Signals (Sa) themselves. Fror orre- 
observation it is inferred that (Ви 
sponding patterns of articulation oustic 
actually mediate between the e the 
Signals and the Overt reports O nals 
listener? Since the speech res 
Generated by articulation are p (Fant, 
able from anatomical data ( per- 
1960), a Correlation between A age 
Ceptual response and the pene 
ulus necessarily implies a Tae 

etween the Perceptual aoe gE 
the pattern of articulation that g mere 
ated the stimulus. Therefore, pei 
Correlation of perception with ar ноп 
tion does not Warrant the introduc the 
ofa Mediating process in analyzing to 
istener’s behavior in IESP ОШБУ 
Eeeeh Signals, Ta order to esta the 

S Usefulness of this inference, tha 
experimenta] evidence must show 


д the 

*In the most Tecent explication Poe 
theory, Liberman, Cooper, Harris, mediating 

x 63) Suggest that the m move 
be one or two steps man 8 
Articulation: “The neural o il the 
Ives, or rather their саара 
central nervous system, might be ne of 
Provide the Teference system in sa carried 
which the decoding [of speech] is 
out [p, 8].” 
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the perceived similarities and differ- 
ences of a set of speech sounds corre- 
spond more closely to the articulatory 
than to the acoustic similarities and 
differences among the sounds. In 
other words, Ry’ is inferred when there 
15 а greater isomorphism between 
changes in Ry and Rp than between 
Sa and Rp. 

There are fundamentally three kinds 
of evidence, therefore, that have been 
viewed as favoring a motor theory of 
Speech perception; these serve as the 
framework for the present discussion : 


*In linguistic terms, Liberman, Cooper, 
Harris, and MacNeilage (1963) have put it 
this way: “The articulatory reference theory 
[rests] empirically on the evidence which in- 
dicates that the relation between phoneme 
and articulation is more nearly one-to-one 
than is the relation between phoneme and 
acoustic signal [p. 7].” 


4000~ 


Fig, 
Were 
Liber, 


-l. The frequency positi 
Combined (as in C) and 
man et al., 1952.) 


ons of the schematic stops (A 
converted into soun 


(a) when diverse perceptual responses 
are evoked by a set of acoustically simi- 
lar speech signals produced by diverse 
patterns of articulation, (b) when simi- 
lar perceptual responses are evoked by 
a set of acoustically diverse speech sig- 
nals produced by similar patterns of 
articulation, and (c) when both articu- 
lation and the acoustic signal change 
continuously over some range, and 
changes in the perceptual response 
correspond more closely to changes in 
articulation. 


Evmence THAT Favors A MOTOR 
THEORY or SPEECH PERCEPTION 
Similar Acoustic Stimuli, Diverse Per- 

ceptual and Articulatory Responses 
Over the past few decades, scientists 

at the Haskins Laboratories have 

greatly advanced our understanding 


о Ol 0203 04 


TIME (SECS) 


as in ask 
(Eastern) 
as in 
jaw 
as іп 
tooth © 


) and vowel formants (B) that 
d for identification by listeners. (After 
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of the perception of speech by experi- 
ments involving synthetic speech stim- 
uli, generated by a device called Pat- 
tern Playback, which permits careful 
control of stimulus parameters. 
Among the earliest studies whose find- 
ings are seen as favoring a motor 
theory of speech perception is an inves- 
tigation of the perception of unvoiced 
stop consonants by Liberman, Delattre, 
and Cooper (1952), employing the 
Pattern Playback to convert hand- 
painted spectrograms into sound. Fig- 
ure 1, Part C, shows one of the Sche- 
matic syllables that was converted into 
sound, and Figure 1, Parts A and B, 
Shows the two acoustic variables of the 
experiment: the frequency locus of a 
Schematic stop and the formant fre- 
quencies of the following vowel, The 
12 stops before each of the 7 vowels 
gave a set of 84 consonant-vowe] (CV) 


5 For à description of Pattern Playback 
and its principles of Operation, see Cooper 


(1950, 1953 у 
Borst (1931), and Cooper, Liberman, and 
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Syllables that were presented to sub- 
jects for identification. 

When the frequency locus of X; 
Schematic stop was held constant «ie 
the following vowel was varied ove 
the seven pairs of formant neges 
shifts in the modal consonant Men 
cation were obtained. For examp < 
the schematic stop at 1,440 cycles pe 
second was called /p/ (59 times out h 
60) before /i/, but it was increasing у 
often called /k/ as the following bans 
ranged from /i/ to /a/; it was са = 
/k/ (57/60) before /a/ but poete 
ingly often called /p/ as the ager 
vowel ranged from /a/ to /u/; 50) 
Same stop was called /p/ (42/ 
before /u/. : 

In order to account for the — 
consonant identifications evoked ge 
Set of invariant schematic stops, 


investigators pointed to the arem 
patterns of articulation necessary 
generate the natural-language CO 
lates of their acoustic stimuli : 


rre- 
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75 


50 


25 


PER CENT IDENTIFICATION 


6 7 8 9 10 
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Lodi 


п 12 13 14 


STIMULUS VALUE 


Гіс, 3, 


shown in Figure 2. 


Wi 

at ia mouth set to articulate /i/ or /u/, 
of the = would presumably use a movement 
Fe Ев to produce a sound having roughly 
schematic acoustic characteristics as our 
аў, a C stop at 1440 cps; for the vowel 
SUE € same acoustic result would be most 
of the depprosimated by lowering the back 
the pM. from a point of contact with 
р. 513] of the mouth [Liberman et al., 1952, 


ie authors then suggest that these 

erative patterns of articulation are op- 

Жапы] during acoustic reception to 

c5 ate acoustic similarity into per- 
Ptual diversity : 


v 
coe isum, then, that the sound patterns 
and она to the syllables pi (or P") 
the list Set off the appropriate movements 1n 
Ments nis (that is, the articulatory move- 
atte; Кы hich the listener would make in 
terns), ing to reproduce these acoustic pat- 
Neural Or perhaps only the short-circuited 
the res nents of these movements, with 
ah ut that the initial schematic phonemes 
Acoustic « Which can be entirely identical as 
in DUE Stimuli, become clearly differentiated 
na Е terms and аге therefore 
P. 514]. eived as distinctly different sounds 


Identification of the consonants synthesized with the spectrographic patterns 


and /g/ i (Each point shows the relativ 
8/ in 32 presentations of each stimulus to one listener. After Liberman et al, 


e frequency of the responses /b/, /d/, 
1957.) 


Several more recent experiments by 
Liberman and co-workers have also 
examined labeling responses to syn- 
thetic consonant-vowel pairs, and have 
provided support for the motor theory 
of speech perception. The five studies 
summarized below have in common 
that the acoustic parameters of the 
stimuli that controlled consonant iden- 
tification all varied regularly over some 
range, while the parameters of the fol- 
lowing vowel were held constant. 

Liberman, Harris, Hoffman, and 
Griffith (1957) synthesized a series of 
14 speech stimuli, using the patterns 
shown in Figure 2, which varied only 
in the direction and extent of second- 
formant transition. When the auditory 
stimuli were presented in irregular 
order to a group of listeners for iden- 
tification, the labeling functions that 
were obtained resembled the one shown 
in Figure 3 for a typical subject. The 
relatively steep gradients that define 
the phoneme boundaries /b/—/d/ and 
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100 


a 


PER CENT IDENTIFICATION 
m a 
e о 


[9 
20 


60 


80 100 120 


SILENT INTERVAL IN MSEC 


Fic, 4. Identification of synthetic s; 
the duration of the silent inter 
frequency of the геѕропѕеѕ /b/ 
From Liberman, Harris, Eimas, 


/d/-/g/ are seen 
favoring the infe. 
ticulation. The 
that, in natural ] 
three phonemes 


val between s 


"partition" the stim 
flect diverse labelin 
ever, the speech s 
an acoustic conti 


ceptual Tesponses, 
A later study by Liberman 


‚ Е 

action 0 

peech stimuli as /rzbid/ ог /repid/ as a fur 
yllables, 

and /p/ in 32 presenta 

Lisker, & Bastian, 1961.) 


relative 


1 an 
(Each point shows the mea listeners. 


tions of each stimulus to 12 


:cal sub- 
The labeling function for a po /do/ 
ject shows greater than + an 
identification for Stimuli mulus 5, 
(22,5% /to/), 80% at Stimul y 
14% at Stimulus 4, 5% at Stimuli 
and no /do/ identifications a honeme 
6 and 7. “For all Ss, the Pis quite 
boundary between /d/ and /t/ e of 10 
Sharp; in every case a аел uek a3 
msec, in the first-formant cu s from 
Sufficient to shift the response’ fhe 
75% /d/ to 75% /t/ [р. 583], оне 
near dichotomy їп labeling voiceless 
Corresponds to the voiced-v ia. 
dichotomy in classifying the natura 
tions that produce /d/-/t/ 21 


language, s, СОП" 


Eimas, 
loye 
Lisker, and Bastian (1961), emP d/- 


the phonemic contrast ипи were 
repid/. Twelve speech stin ilent 19° 
Synthesized with intervocalic 140 milli- 
tervals that varied from 20 tC ilisecon ‘ 
Seconds in steps of 10 d frequen? 

igure 4 shows the relative hese S! с 
of /b/ and /p/ responses to t estigat™ 
uli by 12 listeners, The inv 


Motor THEORY or 


conclude from these data that, once 
again, “the phoneme boundary is rea- 
Sonably sharp [p. 183]." As in the 
Prior study, the corresponding dichot- 
omy in classifying articulation of the 
Speech stimuli in natural language is 
Voiced-voiceless. 

The next pair of studies provides 
additional evidence of “categorical per- 
ception” of speech stimuli and also 
dlectromyographic data to confirm that 

€ patterns of articulation that gener- 
ate these stimuli are similarly cate- 
Sorical Bastian, Eimas, and Liber- 
man (1961) inserted silent intervals of 
Varying lengths into natural recordings 
ОЁ the word /slrt/ and presented them 
to listeners for identification as either 
/sht/ or /split/. “The perception of 
this continuum can be regarded as es- 
Sentially categorical. . . . The categori- 
Cal perception of the consonants may 

€ explicable in terms of the categorical 
nature of their articulatory gestures 
yes, Me absence or presence of the 
Шама] plosive; р. 842]." 
arris, Bastian, and Liberman 
(1961) have shown that, when subjects 
Lug instructed to mimic stimuli 
p, PM from the /slit/-/spht/ con- 
N uum, the silent intervals in their re- 
Ponses fell into a bimodal frequency 
sistribution ; they were, in other words, 
= кашу categorical. To determine 
Vhether there were incipient /p/ move- 
Ments in the region of the phoneme 
pe electromyographic record- 
es were made during mimicry ; these 
2 also entirely categorical: "Either 
а Was a normal burst of muscle po- 
ite ial at the lip (indicating а /р/ 

Sture) or there was not [p. 842].” 

bem, each of the phoneme-labeling ex- 

ments described above, the investi- 
oa also presented their listeners 
Id the synthetic speech stimuli ar- 

E in ABX triads, in order to 
sure percent-correct discrimination 


With; А 
thin pairs of stimuli drawn from the 
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speech continuum. In accordance with 
the motor theory of speech perception, 
the experimenters expected to find that 
two stimuli drawn from the same 
phoneme category (defined by the 
labeling gradients) are less often cor- 
rectly discriminated than two stimuli 
drawn from opposite sides of the pho- 
neme boundary. The rationale is 
spelled out by Liberman, Harris, 
Eimas, Lisker, and Bastian (1961) : 


We believe that in the course of his long 
experience with language a speaker (and 
listener) learns to connect speech sounds 
with their appropriate articulations. In 
time, these articulatory movements and their 
sensory feedback (or, more likely, the cor- 
responding neurological processes) become 
part of the perceiving process, mediating be- 
tween the acoustic stimulus and its ultimate 
perception. When significant acoustic cues 
that occupy different positions along a single 
continuum are produced by essentially dis- 
continuous articulations . . . the perception 
becomes discontinuous (i.e., categorical), and 
discrimination peaks develop at the phoneme 
boundary [p. 177]. 


The experiments to be described, 
which examined ABX discrimination 
of synthetic speech stimuli, had two 
further objectives. Their second ob- 
jective was to see to what degree the 
data warrant the extreme assumption 
that “the listener cannot hear any dif- 
ferences among these sounds beyond 
those that are revealed by his use of 
the phonemic labels [Liberman et al., 
1952, p. 359]." This outcome may be 
expected if both the labeling responses 
and ABX discriminations are entirely 
dependent on the proprioceptive feed- 
back from the mediating articulation 
hypothesized by the motor theory of 
speech perception. The contrary as- 
sumption, namely that the listener can 
discriminate among many more stimuli 
than he can identify absolutely, is in 
accord with psychophysical data on the 
perception of nonspeech stimuli (eg. 


Miller, 1956). 
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Fic. 5. Discrimination of the synthetic s 
in ABX triads to one listener. 
for this subject, shown in Figure 3, 
frequency of correct discriminations ir 


1957.) 


The third and last Objective of the 
discrimination studies that bears on the 
motor theory of speech perception was 
to assess the degree to Which the dis- 
crimination functions reflected “ас- 
quired similarity," "acquired distinc- 
tiveness," or both: 
One Possibility [aeq 


in the raw these sp 
highly discriminable 


uired similarity] is that 
eech stimuli were all as 
as the most discriminable 
ffect of our linguistic ex- 


The motor theory 


15 seen by Libe 
latter outcome: 


of speech Perception 
rman as favoring the 


у articulatory 
responses. . . | үү Suppose that 


the perceived difference between two rela- 
tively similar external stimuli Could be in. 
creased if we could attach to those stimuli 
two very different mediating responses and 
hence gain the added distinctiveness of their 
very different proprioceptive returns [p. 123]. 


Discriminations among synthetic 
speech stimuli were measured, in the 


^ zs esented 
реесһ stimuli shown in Figure 2, when pr 
CThe predicted 


Each point 


^ dients 
functions are based on the labeling Be av 
on the obtained functions shows the 


$ ‘herman et al» 
n 42 presentations of each triad. After Liberma 


first of the phoneme-labeling expe 
ments described above (Тл! {һе 
et al., 1957), by arranging pairs X 
14 stimuli in triads of the form dure, 
and ABB. In this “ABX” proce the 
the first stimulus in a triad, A, 15 ech 
i stimulus in the series of 14 pes 
sounds, the second stimulus, В, i5 jo- 
i+ 1% or i 4- 2th or ў 4 3 (one, Sid 
and three-step comparisons, ү) 15 
tively), and the third stimulus, {lus 
either the A stimulus or the B ol 
—the subject is required to say W ined 
The discrimination functions yc 
for a typical listener are shown in with 
ure $. Comparison of these abe in 
the Corresponding labeling gradie! dis- 
Figure 3 shows that the listener veen 
criminates more accurately к the 
Sounds that lie on opposite sides ^ be- 
phoneme boundary than he doe same 
tween sounds (separated by the from 
number of physical units) drawn 
within one phoneme category. ee to 
In order to assess the Se are 
Which the listener “cannot veo d by 
ences . . | beyond those revea dicte 
the phoneme labels," a set of pre rive 
discrimination functions was de the 
from the labeling gradients, ОП 
basis of three assumptions : 
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1. Two speech stimuli are discrimi- 
nated only to the extent that they are 
labeled differently (assigned to differ- 
ent phoneme classes). 

2. The phoneme classes to which the 

stimuli are assigned during the labeling 
task are the same as those to which 
they are assigned (by mediating proc- 
€sses) during the discrimination task, 
and the relative frequencies of assign- 
ment are the same in the two cases. 
3. ‘The probability of assigning the 
Stimulus in a discrimination triad to 
the jt phoneme class is independent 
of the assignments for the other stimuli 
їп the triad. 


ith 


Figure 5 shows that, in the case of 
the /b/-/d/-/g/ continuum, the dis- 
crimination functions predicted from 
пе labeling gradients correspond in 
°rm to the obtained functions but lie 
ав below them. The authors 
_ c ude that listeners discriminate the 
Ever stimuli largely to the extent that 
a m D them differently, but they are 
dió O extract some information in ad- 

‘on to that reflected in the labeling 
&radients, 
th In order to assess the degree to which 

© speech-descrimination functions re- 
tinc tired similarity, acquired dis- 
ауе муе or both, it is necessary to 
herent line of discrimination which 
of the ably reflects the discriminability 
hn берй stimuli “in the raw"—that 
Басе E: language learning has taken 
бп ds he desire for a base line that, 
à dults one hand, can be obtained from 
Speech, who | already have acquired 
other 1 discriminations, and, on the 
= inge Me has some face validity as 
tine aS of discrimination in its pris- 
Patibje 1» places two mutually incom- 
tro] ст шеген оп the set of con- 
Де, yee used to obtain this base 

* tirst, the control stimuli should 
Comparable to the speech stimuli, 
Ying in the same acoustic properties 
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and with the same constant parameters, 
so that the difference in discriminabil- 
ity of the speech and control stimuli 
may not be attributed to a difference 
in the psychophysical task ; second, the 
control stimuli must not evoke the 
speech discriminations under study, so 
that their discriminability may indeed 
serve as a base line for assessing the 
effects of training in the speech dis- 
crimination. In practice, the control 
stimuli that are employed represent a 
compromise between these two sets of 
requirements; usually some of the 
constant parameters of the speech stim- 
uli are altered so that the transmuted 
series is not perceived as speech but 
varies in the same parameter as the 
speech stimuli, 

Evidence for acquired distinctiveness 
of the speech sounds, which is inter- 
preted as favoring the motor theory of 
speech perception, could not be ob- 
tained in the study of the phonemes 
/b/-/d/-/g/ described above because 
a suitable set of control stimuli was not 
prepared. In experiments with the 
contrasts /do/—/to/, /rzbrd/-/repid/, 
and /slit/—/split/, summarized below, 
the obtained discrimination functions 
provided all three kinds of evidence 
favoring the motor theory of speech 
perception: increased discriminability 
at the phoneme boundaries, close cor- 
respondence between obtained and pre- 
dicted discrimination functions, and 
much greater discriminability of the 
speech stimuli than their nonspeech 
controls, showing acquired distinctive- 
ness. 

In the experiment employing speech 
stimuli that ranged perceptually from 
/do/ to /to/ (Liberman, Harris, Kin- 
ney, & Lane, 1961), the stimuli were ar- 
ranged in one-, two-, and three-step 
comparisons in ABN triads of the 
form: ABA, ABB, BAA, and BAB. 
The discrimination functions obtained 
from 11 listeners agreed well with 
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those predicted from their labeling 
gradients. "Discrimination was better 
for a given stimulus difference. when 
the two stimuli lay on opposite sides of 
a phoneme boundary than when the 
stimuli were within the same phoneme 
category [p. 384].” 

In order to obtain a set of non- 
speech control stimuli that differed 
with respect to cues comparable to 
those defining the speech continuum, 
the authors inverted their speech spec- 
trograms before converting them to 
sound with the Pattern Playback.® 
The control stimuli were prepared in 
ABX triads accordin: 
cedure followed for 
and were presented 


times to the same listeners employed 
for the labeling and discrimination 
phases of the experiment. The dis- 
crimination functions that were ob- 
tained for the control stimuli did not 
depart appreciably from chance, they 
did not show any regularly occurring 
peaks, and they lay well below the 


Speech-discrimination functions in all 
comparisons, 


g to the same pro- 
the speech stimuli 
a like number of 


The authors conclude that, to the 
extent that the control stimuli pro- 
vided a valid measure of the discrimi- 
nability of the Speech stimuli prior to 
language learning, the Speech-discrimi- 


eflect acquired dis- 


poral order of tw, 
fering frequency, 
time of onset to give 
ment was about 17 


milliseconds. In 
comparison, the diffe 


Tence in time of 


®Certain additional modifications were 
made in the control patterns "to provide a 
further precaution against the possibility that 
the control stimuli would resemble speech 
[Liberman, Harris, Kinney, & Lane, 1961, 
p. 381]." 
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onset of the first and second (pem 
of the synthetic speech stimuli tha! à oho 
75% correct discrimination was k in 
12 milliseconds. Thus, the pea Bon 
the /do/—/to/ discrimination bs A 
again may be considered to refle be 
quired distinctiveness. Fel | 
Moore (1962) obtained a E fto] dis 
control base line for the / do/ _/ ү an 
crimination by measuring, ibility of 
aphasic subject, the discriminabi xe 
the same set of speech stimuli emp Дея 
by Liberman, Harris, Kinney : 21 imina- 
(1961). Chance levels of es ere 
tion (and a flat labeling gradien train- 
Observed prior to isenimin SE of 
ing. Following training, a uec 
Phoneme identification was o well 
and ABX discrimination was -step 
above chance levels in all Eo 
comparisons, peaking at the ph 
boundary, ; 12 
In "А experiment ph у 
Speech stimuli that ranged tna} 
from /rebid/ to /reptd/ (I Bastian, 
Harris, Eimas, Lisker, & ved in 
1961), the stimuli were pte in 
two- through nine-step compar? func- 
ABX triads. The discrimination hown 
tions obtained from 12 депе) б 
by the filled circles in Figure ey (eh 
a peak at the phoneme gna well 
igure 4) and, in general, eur (un- 
With those functions predicte gradi- 
filled circles) from the labeling. Eon 
ents. The obtained discre d 
functions lie, at all points, т that 
above those predicted, suggest E veen 
the listener can discriminate e often 
two speech stimuli slightly a itely. 
than he can identify them abso anctions 
The speech-discrimination Tonetti 
shown in Figure 6 are not, k in the 
unimodal. There is one pear spond- 
region of 70 milliseconds cot hase in 
ing to the phoneme boundary k in the 
igure 4 and a second gerd t does 
Tegion of 100 milliseconds boundary: 
not correspond to a phoneme bo 
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SILENT INTERVAL IN MSEC. 


апа 6. 


Obta i 


tion, "d f 


he ay 

A nam considered that the stimuli 
üteryaj - than 100-millisecond silent 
ү, ee by their listeners as 
that this . | unnatural.” They thought 
© heap) Phoneme “might nevertheless 
i almo and articulated by our listen- 
Deech Š St as if it were a different 

пішу [p. 186]." The listeners 


(T compara 5 
whe erus ресс stimuli (crosses), when presented in 
п they Were; unctions are based on the labeling gradients obtaine 
Se as aoa to identify each stimulus as /b/, /p/, or */p/- 
each triad hows the mean relative frequency of correc! 
. From Liberman, Harris, Eimas, Lisker, 


Discriminati s 
crimination of the synthetic speech stimuli /rabid/-/repid/ (filled circles) 


ABX triads to 12 listeners. 
d from these subjects 
Each point on the 
t discrimination in 40 presenta- 
& Bastian, 1961.) 


were then recalled and instructed to 
label the speech stimuli anew, this time 
with three possible responses : /Ъ/, [PI 
апа */p/. The labeling function for 
ibjects then showed a 
third category, */p/, and hence a 
second “phoneme boundary": /р/— 
*/р/. The predicted discrimination 


the group of su 
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functions shown in Figure 6 were de- 
rived from this revised labeling gradi- 
ent. 

In order to assess the degree of ac- 
quired distinctiveness shown by the 
speech-discrimination functions, con- 
trol stimuli were prepared by processe 
ing the speech stimuli so as to yield 
noise bursts with the same amplitude 
envelope and the corresponding silent 
intervals. These control stimuli were 
arranged and presented for discrimina- 
tion measurement in the same manner 
as the speech stimuli. The discrimina- 
tion of the control stimuli proved in- 
ferior to that obtained with the speech 
stimuli (filled circles) in almost all 
comparisons, as shown in Figure 6. 
“The entire learning effect consists of 
a sharpening of discrimination in the 
vicinity of the phoneme boundary [p. 
189].” This conclusion agrees with 
that reached earlier in the experiment 
on discrimination of the phonemes 
/d/-/t/. 

In the experiment employing speech 
stimuli that ranged perceptually from 
/sht/ to /split/, reported by Bastian 
et al. (1961), ABX discrimination was 
also measured and found to show peal 
in the region of the phoneme boundary 
and, over all, a close corespondence to 
the discrimination functions predicted 
from the phoneme-labeling gradients, 
The acquired distinctiveness of the 


/sht/-/spht/ stimuli was not deter- 
mined. 


ks 


The following kinds of evidence, viewed 
by the experimenters as favoring the motor 
theory of speech perception, have been cited 
in this section: When the frequency locus 
of a schematic stop was held constant and 
the following vowel varied, consonant iden- 
tification varied in a way that seemed to 
accord with the varied patterns of articula- 
tion necessary to produce the comparable 
speech stimuli in natural language. When 
synthetic speech stimuli were selected from 
an acoustic continuum that ranged percep- 
tually from one phoneme to another and 
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" for 
these stimuli were presented epe dod 
identification, the phoneme-labeling £T ENS 
were quite steep, suggesting сав ted by the 
ception of the speech stimuli, media uired to 
discretely different articulations Же lan- 
produce the two phonemes in sen stimuli 
guage. When the synthetic ерес were 
employed for the labeling experim imination 
arranged in ABX triads for e dis- 
measurement, listeners more Te Geer op- 
criminated between stimuli саип than 
posite sides of the phoneme pr 
between stimuli drawn „from aise 
phoneme category, suggesting that involve 
different mediating responses were hancing 
in the former discriminations, C'tifferent 
discriminability by their very imination 
proprioceptive returns. These discri redicte 
functions accorded well with those P imption 
from the labeling gradients on the ass speech 
that the listener could discriminate labeled 
stimuli only to the degree that discrimina- 
them differently, suggesting that пош 
tion was controlled, not by the con b 
varying speech stimuli, but rather ЭУ, 
proprioceptive return from only tw 
ent mediating responses. btaine 
base line of discriminability was o! ropriate 
the speech stimuli by presenting арр 
control stimuli for discrimination 5 
ment, the speech discriminations We 
to be much more accurate in all ар speech 
Suggesting acquired distinctiveness yediating 
stimuli, brought about by the Е 
articulations that these stimuli evoke. 


е foun 
arisons: 


; . "mw Per- 
Diverse Acoustic Stimuli, ewe 
ceptual and Articulatory Respo’ 


1687 

Liberman (1957) has put the q" 1 
tion: “When articulation and 
wave go their separate ways, 
Way does the perception go? [P- ^ 


à theory 
His answer, favoring the UM per 
ОЁ speech perception, was: culation 


ception always goes with art! dud 
[р. 121].” Part of the supper ori 
this answer is provided by petes P 
ments discussed above, in W similar 
verse articulations produced diverse 
acoustic stimuli that evoked al 51р” 
perceptual responses. Ада iy the 
port for the answer is provide sed е" 
complementary findings, peret ie 
low: Similar articulations pro 
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Fic. 7. The spectrographic patterns that produce /d/ and /g/ before various vowels. 


(The dashed lines are extrapolations to the /d/ and /g/ loci. 


verse acoustic stimuli that evoke simi- 
lar perceptual responses. 
í The first evidence of this sort comes 
ш a study of the perception of the 
Ор consonants discussed earlier 
{Liberman et al, 1952). When the 
BT locus of a schematic stop 
te eni formant frequencies of a fol- 
кз ng vowel were both allowed to 
mt greatest number of /k/ iden- 
A. lons was not associated with any 
de Schematic stop, but varied widely, 
arn on the following vowel; for 
не а schematic stop at 2,520 
jud S per second gave the most /k/ 
* aerae if the following vowel was 
mu Ki one at 710 cycles per second 
“followin: most /k/ judgments if the 
Shake ng vowel was /u/. The nearest 
Asa equivalents to the schematic 
C due all produced by similar artic- 
of th y movements (although the point 
бр be tongue tends to be slightly back 
ome vowels and more forward for 
Тра ОЕ “We may say then, in 
that t to schematic stops heard as k, 
Sdn ne perception reflects the basic 
тае rity in the articulatory patterns, 
ei bi the relatively gross differ- 
S14] » ОЕ the acoustic stimuli [p- 


то 


abl The anomalous status of /k/ is presum- 


Уо Бе explained оп the basis of some dis- 


From Liberman, 1957.) 


Additional evidence seen as favoring 
the motor theory of speech perception 
is provided by experiments on the per- 
ception of the stop consonant /g/ in 
initial position. Liberman, Delattre, 
Cooper, and Gerstman (1954) have 
shown that the direction and degree of 
second-formant transition serves as a 
cue for the perception of the stop con- 
sonants, and that the degree or rate of 
transition necessary for modal iden- 
tification of a given stop consonant in 
a CV pair depends on the vowel that 
follows it, as shown in Figure va 
Therefore, the perception of the con- 
sonant remains invariant from one CV 
pair to the next, despite large changes 
in the formant transitions. With re- 
spect to the stop consonant /d/, the 
investigators were able to associate the 
perceptual invariance with an acoustic 
invariance: Different /d/ transitions 
before different vowels "seem to be 


coming from the same frequency posi- 


tion—namely the /d/ locus at 1800 cps 
In the 


[Liberman, 1957, p. 121]." 


not at the articulatory level, but 
at the point of conversion of the articulatory 
ure to acoustic output. It would appear 
acoustic description provides a 
poorer approximation to the ideal phoneme 
than does the articulatory characterization, 
at least for /k,g/ [Lisker, Cooper, & Liber- 


man, 1962, p. 94]." 


continuity, 


gesti 
that the 
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case of /d/, therefore, perception, p^ 
ticulation, and the acoustic stimulus " 
“go” the same way. In the case o 
/g/, however, no acoustic invariance 
was found to be associated with the 
invariant perception (or with the simi- 
lar articulations that produce /g/ be- 
fore the several vowels). Figure 7 
shows that this consonant requires a 
large transition originating at about 
3,000 cycles per second when the fol- 
lowing vowel is /i/, /e/, /e/, or /a/. 
When the following vowel is /2. /, /о/, 
or /u/, however, the transition is, on 


the contrary, very small, and the locus 
has shifted markedly. 


Here, the large and abrupt change 
acoustic pattern contrasts wi 


tively small and continuous 
ноп 


іп 
th the compara- 
shifts in articula- 
. The identity of the perceived 
consonant remains the same throughout, and 
thus parallels the invariance of the articula- 


tion [Cooper, Liberman, Harris, & Grubb, 
1958, p. 937]. 


Perhaps the most striking invariance 
in the perception of speech under wide 
changes in the acoustic stimulus is the 
constancy of vowel identification de- 
spite changes in the vowel-formant fre- 


quencies, due to differences am 
Speakers, dialects, 
vironments. 


ong 
and phonetic en- 
Tf each vowel phone is 
Tepresented as a point on a vowel- 


formant chart, whose coordinates are 
the first- and Second-formant frequen- 
cies of the vowels, and if the set of 
points Corresponding to each phoneme 
is considered to define a vowel region, 
then the regions of different. vowels 
sometimes overlap, and the regions of 
the same vowels Sometimes fail to over- 
lap. Even under optimal conditions, 
when vowels are uttered in neutral 
phonetic contexts by practiced speakers 
and only those vowels are analyzed that 
are identified unanimously and con- 
sistently by a group of listeners, even 
then, the assignment of a vowel phone 
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to a phoneme class is ambiguous, a$ 

vn in Figure 8. Е 
E order a account for the im d 
tual constancy of the vowel pedis 
despite wide variations in its a E 
properties, Joos (1948) put Br. ve 
a classic monograph, a motor th Pd 
of vowel perception. Joos notes, 248 
of all, that it is possible to find aly 
order among the vowel regions 50 
on the basis of their acoustic рат e 
eters. This ordering may be — 
graphically as a vowel triangle Led 
posed on the formant plot; its rye үз 
are the vowel regions /i/, /а/, an ы 
When such a reference system 1S d 
for the vowels of a particular spea 5 
most of the other vowels he гре 
lie, ёо а degree, on the borders ssl 
within the triangle and in ae 
predictable positions. Тһе е 
that is fit to the utterances of a раг E 
lar speaker, however, differs from that 
Speaker to the next. Joos notes a 
the between-speakers variance in d by 
regions may be greatly reduce es 
"sliding" the several vowel Безе E 
into congruency, and he descri ite 
statistical procedure for HUE т ге 
this. Joos suggests that this pue 
is a model of the process by № ei 
Phonemic constancy is achieved in NT 
ception. He contends, наре 
that this normalizing process in V ihe 
Perception is controlled, not bY their 
speech signals themselves, but by 
articulatory correlates: 


" of 
The brain, then, contains an equivalent о! 
ig. 29 [a schematic of the оён Hy 
articulation] and upon this map it C yowe 
own an equivalent of Fig. 26 [a height 
quadrilateral with the coordinates: during 
and position of the tongue dump, Jatter 
articulation], and if it has two of the 


a t the 
it can shift and distort one of them to fi 
other [p. 63]. 

_——_ 


r is used 
5 Sometimes a vowel quadrilateral ts Dos 
as a model. This does not chang 
(1948) theory substantially. 
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Fig, 8 T 
- The frequency of the second formant versus the frequency of the first formant for 
10 vowels by 76 speakers. (From Peterson & Barney, 1952.) 


ES suggests that the brain may de- 
the ED vowel quadrilaterals much in 
Site nner now used in certain auto- 

: Speech-recognition procedures. 
dar" trips says, "How do you 
m а listener registers the first 
on А me as this speaker's articu- 
high ia central, the last phone as 
cm ou, and the median /у/ would 
em’s c mine high front. “The pat- 
other ag are now located, and 
Spaced” hones can be assumed to be 
нав to them as they are 
61].» y spaced in this dialect [p. 


T! 

M following kinds of evidence, viewed 
y eeepc as favoring the motor 
this se speech perception, have been cited 
а schem ction: When the frequency locus of 
frequen atic stop consonant and the formant 
allowed t of the following vowel were 
Cation O vary, the same consonant identifi- 

Was evoked by diverse acoustic pat- 


theor: 


terns, suggesting mediation by the similar 
articulatory responses required in natural 
language to produce the consonant before 
the several vowels. When a stop consonant 
was synthesized before each of a series of 
vowels, by varying the transition of the 
second formant preceding its steady state, 
the same consonant identification was evoked 
despite a discontinuity in the series of 
second-formant transitions, suggesting, once 
again, mediation by the similar articulatory 
responses required to produce the consonant 
before the several vowels in the series. 
Finally, when the formant frequencies of 
natural-language vowels vary, two distinct 
vowel-formant regions, corresponding to ut- 
terances of the same vowel by two different 
speakers, evoke the same vowel identifica- 
tion while two overlapping regions, corre- 
sponding to utterances of different vowels 
by different speakers, evoke different iden- 
tifications. These findings suggest that the 
listener normalizes the acoustic vowel system 
of each speaker in order to map it onto a 
fixed set of vowel identifications, and that 
this is accomplished in terms of the articu- 
latory correlates of the vowels rather than 
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in terms of their more diverse acoustic 
parameters. 


Changing Perception Correlated Less 
with the Acoustic Stimuli, More with 
the Patterns of Articulation that 
Produce those Stimuli 


The four experiments summarized 
below have in common that the acoustic 
variable was the sound pressure level 
of the utterance and the perceptual 
response was some estimate of loudness 
or stress. | 

The suggestion that the perception 
of a suprasegmental feature of language 
such as stress may involve mediating 
articulation was perhaps first made by 
Jones (1932) who wrote: 


Accent is sui generis depending for its 
perception on the kinesthetic Sense, . . , 
The listener refers wh 

would say it. 
tor into propri 
thetic memory 


at he hears to how he 
Thus he translates exterocep- 
Oceptor sensations, the kines- 
Serving as stimulus [p. 74]. 


In a study conducted by Ladefoged, 
Draper, and Whitteridge (1958), the 
sentence: “The old man doddered 
along the road” was uttered by one 
speaker while the action potentials of 
his internal intercostal muscles were 
measured. The authors report that 
listeners? estimates of the stress loci in 
the sentence corresponded to the tem- 
poral loci of heightened muscular ac- 
tivity. "They suggest [p. 9] that state. 
ments about stress “are usually best re- 
garded as statements about the speaker's 
muscular behavior (or about the ac- 
tions of the listener's muscles which 
would have to be made in order to 
produce similar sounds)” 

In a subsequent stud 
Ladefoged, and Whitteridge (1959), 
speakers were instructed to count to 
20, making the words equally loud, 
The authors report that the peak in- 
tensities of the words differed by as 
much as 15 decibels, “but subjects were 

not usually aware of these differences 


y by Draper, 
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[p. 21]." Although a speaker's ades 
level varied, the mean level of air p ase 
sure in his lungs, found in other deus 
urements to depend on "the loudn to 
with which the speaker was pe a 
talk," remained "fairly constant" un ual 
instructions to count to 20 with that 
loudness. The authors conclude v 
“naive listeners, obeying an instruc “is 
to consider the loudness of sounat 
continuous speech, do not ie but 
acoustic properties of the soun S uidi 
consider, instead, the pressure pes 
would be required below the v 
cords [p. 21]." КР В 
The pda Siti controlled pd 
ment whose findings are seen as coru 
ing the inference of mediating ar was 
tion in the perception of poe 
performed by Ladefoged and Mc же 
(1963). The five words: anh s By 
BAY, BORE, and Boo were Oe pees 
one speaker at various vocal леа8- 
under conditions that permitted eu 
urement of the peak sound nec 
and peak subglottal air pressure pee 
the voicing of each utterance. 2nd 
utterances were tape-recorded, 4 Я 
each was spliced into Ње ig 5 
carrier phrase: “Compare the v ners 
BAR and. . , ” Thirty win 
were presented with 65 peii jects 
structed in this way. The iom 
were instructed to consider the | the 
stant) loudness of the word BAR Pago” 
carrier to have a magnitude O f the 
and to state the relative loudness amber 
terminal word by assigning it a n as its 
in the same proportion to 10 ВАК. 
loudness was to the standard, yates 
The means of the loudness estin the 
(L) were found to be related eat 
Sound pressure level (S) by a Fo). 
function with exponent 1.2 (Le ;ords 
The loudness estimates of the “(W) 
may also be related to the work the 
done upon the air in 8 
Voiced sounds. This work is рона! 
tional to the product of the subg 
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чш and the rate of flow of air 
hrough the glottis (W«P-F). In 
an earlier experiment of this series, the 
authors showed that Pa«F (hence 
Үр) and that P«S* The last 
re expressions give W œ S**. Since 
е 512, the loudness estimates of the 
qw Stimuli were linearly related to 
м done in producing them. The 
a a conclude: “It would appear 
whit пе perceived loudness of words 
Fat es within tbe normal speech 
n ge is largely dependent оп... the 
Physiological effort required to produce 

them [p. 459]." 
Nes Catania, and Stevens (1961) 
ae reported that the speaker's esti- 
the he his own vocal level grew as 
fired power of the sound pressure he 
ies bo As Ladefoged and McKin- 
that g 63) point out, their finding 
EE ae ? suggests that the Lane 
TER 961) speakers were judging 
Ке ogical effort. This hypothesis 
‘eee by the finding that the 
Дап Phonic scale," ye S*!, was not 
ж р when the speaker was de- 
fad of auditory feedback by intense 
генны. noise. Ladefoged апі Mc- 
Ке | (1963) conclude that the loud- 
ated b a speech sound, whether gener- 
ела the listener himself or by an 
of he а Da perceived in terms 
Бейбаш ological effort required to 

t е that sound. 

Speech M^ study of the perception of 
t pe панаа to be cited іп support 
Was ea or theory of speech perception 
(1959 Ported by Lehiste and Peterson 
Pared ^ A tape recording was pre- 
Need in which various vowels pro- 
Made: equal effort were mixed at 
Мемо with vowels produced with 
ranged effort. The vowels were ar- 
Senteq in pairs at random and pre- 
list. at the same sound pressure level 
oude eners instructed to judge the 
“ ШИ of the two vowels in the pair. 
st invariably, the listeners iden- 
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tified the vowels that were produced 
with a greater amount of effort as 
louder than vowels having greater in- 
trinsic amplitude but produced with 
normal effort [p. 431].” In the light 
of these findings, the authors suggest 
that: 


speech according 
speech production 


The listener interprets 
to the properties of the 
mechanism rather than according to the 
psychophysical principles of the perception 
of abstract sounds [p. 428]. 


The following kinds of evidence, viewed 
by the experimenters as favoring the motor 
theory of speech perception, have been cited 
in this section: When listeners judge the 
stress loci in a sentence, these judgments 
correlate most closely with temporal loci of 
heightened muscular activity in the speaker, 
suggesting that the listener is guided by 
covert articulatory cues in making his esti- 
mates. When speakers are instructed to 
count, making the words equally loud, large 
changes in intensity are produced although 
the mean level of air pressure in the lungs 
remains constant, suggesting that the speaker's 
criteria for equal loudness depends on proprio- 
ceptive cues from muscles involved in articu- 
lation. When listeners estimated the loudness 
of a word uttered at various levels, their 
estimates were linearly related to the work 
done in uttering the variable word; these 
estimates were also related linearly to the 
autophonic scale which describes the speak- 
er's perception of his own vocal level. 
Inference from these findings suggests that 
the listener judges the loudness of speech 
sounds in terms of the effort cues associated 
with their production. Finally, when pairs 
of vowels uttered with unequal effort were 
presented at equal sound pressure levels and 
listeners judged the louder vowel in each 
pair, the vowels produced with greater effort 
were always judged louder, suggesting that 

f loudness is more closely 


the perception о! : ү 1 
correlated with changes in articulation than 


with the acoustic stimulus. 


EVIDENCE THAT OPPOSES THE MOTOR 
THEORY OF SPEECH PERCEPTION 


The inference that speech is per- 
ceived by reference to articulation was 
arrived at, in the studies discussed pre- 
after examining principally 


viously, i › I 
endent variables : identifica- 


three dep 
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tion, discrimination, and loudness-esti- 
mation functions. The properties of 
these functions that were repeatedly 
cited by the investigators as favoring a 
motor theory of speech perception 
were: the steepness of the identification 
gradients, the correlated peaks in the 
discrimination functions, the accuracy 
with which the discrimination functions 
may be predicted from the labeling 
gradients, and the correlation of loud. 
ness estimates with vocal-effort meas- 
urements. 

This section of the review considers 
some methodological limitations of the 
studies cited above and presents evi- 
dence that the reported relations be- 
tween identification and discrimination 
functions are not at all unique to the 
perception of consonants but describe 
as well the perception of vowels and 
of entirely nonlinguistic stimuli such as 
complex tones, sectors of circles, and 
patches of color, Clearly, these find- 
ings with nonlinguistic stimuli militate 
against the postulation of a special per- 
ceptual mechanism for speech. The 
same findings for the vowel stimuli also 
contravene the motor theory for this 
reason: The theory maintains that the 
Perception of most Consonants is dis. 
continuous or categorical because they 
covertly evoke discontinuous articula- 


tions whose discretely diff 


erent Sensory 
feedback controls the overt identifica- 
tions, whereas the Perception of vowels 


is continuous because the mediating 
articulations are continuous and pro- 
vide similar Sensory returns, 

Finally, this section shows that the 
sensory dynamics of listening and 
speaking are drastically different, dis- 
puting the purportedly close relation 
between the perception of loudness and 
of vocal effort. 
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Identification Functions for Consonants, 
Vowels, and Nonspeech Stimuli 

The following considerations pre 
clude construing identification е 
tions as providing support for a moto: 
theory of speech perception: 


1. The widespread practice of Lae 
selecting subjects and of apes 
their pooled data causes the mm ‘ 
the identification functions associate 
with a stimulus continuum to vary 
arbitrarily, E rds 

2. It is not possible in principle ka 
compare the steepness of vowel е 
consonant labeling gradients in ped 
nonarbitrary way since their absciss 
are noncommensurate, bel- 

3. A direct comparison of the la ws 
ing gradient for a consonant continuu 3 
with that for a commensurable pem 
linguistic continuum reveals that 
gradients are comparably steep. 


Subjects were selected for pe 
tion in several of the experiments : p 
ported above depending on the #5 a 
ness of their labeling gradients, e 
measured on a pretest. For d 
in the experiment employing the Р Ss 
пете contrast /do/—/to/, “the 1 ats 
[from a total of 20] whose judgme s 
Provided the sharpest phoneme ne 
daries were selected for the experime 

iberman, Delattre, & Cooper, 19 z 
Р. 382]”; in the /b/-/d/-/g/ expe I 
ment, “two of the five Ss in eed to 
Were eliminated because they E tly 
apply phoneme labels Ee cae 

Liberman et al., 1957, p. 360] f d 
in the /b/-/p/ experiment, а а 
were 12 subjects in all, chosen fron st 
group of 31 on the basis of a prete to 

he purpose of the selection wie a 
insure that all subjects ultimately * a 
ing in the experiment would ixi M 
Sharp and clear phoneme Doun ý 

iberman, Harris, Eimas, a i 

astian, 1961, p. 181; italics minel- 
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The rejection of 50% of the subject 
population in these three studies ac- 
cording to a criterion that was neither 
explicit, quantitative, nor uniform 
greatly restricts the generality of in- 
ferences based on the findings obtained 
with the remaining subjects. This 
Procedure of preselecting subjects viti- 
ates, moreover, comparisons among 
different experiments with respect to 
the form of the obtained labeling gradi- 
ents, the predicted discrimination func- 
tions which are derived from these 
adients, and, therefore, the disparity 
etween predicted and obtained dis- 
pu, neien functions. As a result, it 
an not been possible to assess the 
that Ex of an important prediction 
аре nape from the motor theory of 
a €ch perception; namely, that per- 
Ption of the consonants is more cate- 
Borical than is perception of the vowels. 
un p procedure of averaging individ- 
нои entification and discrimination 
Eas and presenting group data, 
studi yed in all the synthetic speech 
^ les cited earlier but two, may be 
үи: alongside preselection of sub- 
е таш сотрагіѕопѕ between 
ok ndings of experiments when these 
Eu зов are based directly or indi- 
im = later) on the labeling gradi- 
the b $ ince it is generally true that 
Varies шу of the labeling gradient 
ile lo rom one subject to the next in 
A s along the stimulus continuum, 
Ss улке gradient is considerably 
a gi than the individual gradients 
Н reflects none of them well. Put 
eae terms, this procedure con- 
Which the within-subjects variance, 
gorii] tes to the judgment of cate- 
Subj cup Регсеріоп with the between- 

Jects variance, which does not. 
ents Parisons of identification gradi- 
Кунше difficulty even when 
lected ual gradients for randomly se- 
Whe Normal adults are considered. 
П two gradients, both functions of 


le 
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the same synthetic speech continuum, 
are compared, the judgment of cate- 
gorical perception may be based on the 
extent of the plateaus in response prob- 
ability or, equivalently, the steepness 
of the boundary between these pla- 
teaus. However, the steepness of a 
labeling gradient or, derivatively, the 
height of the predicted discrimination 
function for the same stimulus con- 
tinuum, is of interest largely in rela- 
tion to the steepness of other gradients 
collected with other stimulus continua : 
If the steepness of consonant-labeling 
gradients or the height of discrimina- 
tion functions derived from them is 
cited as evidence of discretely different 
mediating responses, then it must be 
shown that these gradients are more 
steep than those obtained in comparable 
experiments with vowels, whose articu- 
lations are topographically continuous, 
and certainly more steep than those ob- 
tained with nonspeech stimuli. 

When comparing labeling gradients 
for different stimulus continua, one 
gradient may not be called more cate- 
gorical than the other solely because 
it has more extended regions over 
which response probability is zero or 
unity; the extent of these plateaus in 
the labeling gradient is largely a matter 
of experimental convenience. For ex- 
ample, in the experiment whose out- 
come is shown in Figure 4, the silent 
interval for the rabid-rapid continuum 
could have been allowed to vary only 
over the range 40 to 90 milliseconds. 
The outer boundaries of the phoneme 
classes have not been defined experi- 
mentally, so it is not possible to say to 
what extent the gradients “partition” 
the total perceptual range. 

It appears that the sole property of 
diverse gradients that will reliably 
serve as a basis for judgments of steep- 
ness is the value of their slope over the 
range in which it is not zero. How- 
ever, the slopes of the diverse gradients 
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PER CENT IDENTIFICATION 
n 
© 


m 
© 


100 4 NONSPEECH | 
so|- 
s, 
ol- 


s2 
20 40 60 
F, CUTBACK IN MSEC 


Fic. 9. Identification of nonspeech stimuli. 
(The seven spectrographic Patterns in the 
Series employed by Liberman, Harris, Kin- 
ney, & Lane, 1961, to study /do/-/to/ identifi- 
cation were inverted and then converted to 
sound. Each of three subjects was condi- 
tioned to respond po to the stimulus with no 
cutback of the first formant and TO to the 
stimulus with 60-microseconds cutback. In 
generalization testing, each of the seven 


stimuli was presented 60 times, in irregular 
order.) 


(or the entire functions for that mat- 


ter) are obviously not comparable— 
despite the widespread practice of com- 
Paring them—since their abscissae are 
not commensurate” It may still be 
Possible to compare diverse gradients 
if some normalizing procedure is found 
that is not Sensitive to arbitrary differ- 
ences between the experiments, such as 
the number and Spacing of the stimuli 
employed. That there exists а Satis- 


dure may be stro 


There is considerable 


evidence to 
support the conclusion tha 


t the general 
э As an explicit example of these 


improper 
comparisons: “Identification of the con. 
sonants was marked by abrupt transitions 


.. Vowel perception was, 
tinuous: the identification fu 
less steep boundaries . 
206]." 


however, con- 
nctions showed 
- . [Eimas, 1963, p. 
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form of labeling functions for ud 
sonant continua does not differ cone 
that for vowel and nonlinguistic and 
tinua. Clearly defined gum, in 
boundaries have been me 
vowel identification by —— о 
nedy, Liberman, and Stevens ( man 
Stevens, Ohman, and Liber e 
ee Paul aad dignum 
(1962) ; and Fry, Abramson, 5 dm 
and Liberman (1962), among ү nd 
while these properties have been fo ane 
with identification functions for pu 
linguistic continua by Cross, ш 
Sheppard (1965); Lane and tus 
(1964); Beare (1963); and 
and Lane (1962), among others. ie 
There is also evidence that the for 
tailed form of labeling gradients dà 
consonant and nonlinguistic ponens a 
not differ appreciably, Such ws oral 
Sons are appropriate only if the 5 pel 
lus variable is the same in the ider 
Cases. Therefore, Lane and eR ng 
(1963) selected the published la entar 
gradients for /do/—-/to/ as у ann 
tive gradients for consonant con eg 
and then employed the /do/-/to/ adie 
trol stimuli to obtain comparable gr 
ents for a nonlinguistic continuum. dis- 
will be recalled from the earlier by 
cussion of the /do/—/to/ р Ned 
Liberman, Harris, Kinney, and ere 
(1961) that the control stimuli у; 
obtained by inverting the /do/ p 
Spectrograms before converting um 
to sound (an operation which Ше ше 
the stimulus variable), that they pes 
Not perceived as speech и 
that their discrimination func асу. 
showed only chance levels of ассит this 
€ two extreme stimuli mum о 
Series (with 0 and 60 millisecon re- 
cutback in F1, respectively) eds E o 
Sented in random order ќо, 626 ally. 
Sight listeners serving individua a 
The subject was seated in ao was 
tape recorder and counter an enter. 
visually isolated from the experim 
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RELATIVE FREQUENCY 
A Pa í 
О 


0-9 


m SPEECH STIMULI zl 
О NONSPEECH CONTROL 


T— —t T T 


П П П 


10-19 20-29 30-39 40-49 50-59 60-69 70-79 80-89 90-100 


PER CENT DIFFERENCE IN LABELLING 


F А 
of ee 10. Comparison of the degree of “categorical perception 
hetic speech stimuli (Liberman, Harris, Kinney, & Lane, 1961) 


” obtained in identification 
and comparable non- 


Speech sti . : 
stimuli (Lane & Schneider, 1963). (For each experiment, each stimulus was com- 


раге 
di 
ifferences was plotted. 


nonz, R 
ero only in the extreme classes.) 


у told to say /do/ or /to/ in 
that h -second interstimulus interval, 
edis pud: receive a point on the 
and zwi or each correct identification, 
at the each point was worth a penny 
experi end of the experiment. The 
Presented the arama by E protocol, 
each С the point immediately after 
Was Sapete response. Conditioning 
Sone alter either 30 correct 
Sn cations in succession or 400 
eight S T air voie Three of the 
one jects who participated met the 
individ criterion and were presented 
e&t se ually with the generalization- 
Bie: К. in which the seven control 
With ые in an irregular order, 
follow е constraint that each stimulus 
Е ed every other exactly twice. In 
Second through fourth daily experi- 


to every other in percent identification, 
In the limiting case of “categorical perc 


&radi " 
ients would have the form of a step function, and the frequency 


and the relative frequency of within-pairs 
eption,” the labeling 
distribution would be 


mental sessions, conditioning was re- 
duced to 100 stimulus presentations, 
and generalization testing was repli- 
cated. 

The generalization gradient for the 
nonspeech (control) stimuli, obtained 
from each of the three subjects, is 
shown in Figure 9. These gradients 
are directly comparable to the /do/- 
/to/ labeling gradients reported for 13 
subjects by Liberman, Harris, Kinney, 
and Lane (1961). This comparison 
reveals that the steepness of nonspeech 
gradients may equal or exceed that for 
speech gradients. One way of combin- 
ing the individual gradients in each of 
these two studies that is not sensitive 
to differences in the loci of the gradi- 
ents on the stimulus continuum is 
shown in Figure 10. To illustrate how 
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the functions shown there were A 
tained, the procedure employed wit! d 
the individual gradients in Figure 
will be described. For each subject, 
the relative frequency with which the 
/do/ response was emitted to each of 
the seven nonspeech stimuli was noted 
as a percentage. Each Stimulus was 
compared to every other in the percent 
of /do/ responses it evoked, and the 
difference in percent was obtained for 
each pair. A frequency distribution of 
these percent differences was prepared, 
with class intervals of 10%, and the 
frequency distributions for the three 
subjects were combined. The fre- 
quency measures were then converted 
to relative frequency and plotted in 
Figure 10. If the individual gradients 
were step functions, the limiting case 
of “categorical perception,” then all the 
percent differences would be either 0 or 
100, and the frequency distributions 
would have peaks in the extreme 
classes leaving intermediate classes 
empty. 
Comparison of the frequency distri- 
butions in Figure 10 for identification 
of synthetic speech stimuli 


Harris, Kinney, & Lane, 1961) and 


the claim that 
ech stimuli is 


Discrimination Fun 


clions for Conso- 
nants, Vowels, and 


Nonspeech Stimuli 
The relations betwe 
functions obtained for 


continua, those predicted from the cor- 
responding identification gradients, and 
those obtained with comparable non- 
speech control stimuli force an extra 
degree of complexity on the motor 
theory of speech Perception, which 
must introduce a direct route for speech 
discriminations in order to account for 
the relations among these functions, 


€n discrimination 
synthetic speech 
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Concerning the first relationship, p 
trated in Figure 6, the obtained za 
crimination functions lie above ү 
predicted from the identification S net 
ents. This finding has been interpre in 
as showing that "listeners are — 
discriminating these stimuli to ex * Я 
some information in addition to b 
Which is revealed by the way in yara 
they label the stimuli as phonem & 
Liberman, Harris, Eimas, Lisker, * 
Bastian, 1961, p. 185]." This inter- 
pretation implies that the cu dere d 
Speech discriminations are contro реч 
by mediating articulations and by = 
acoustic stimulus, either alternately be 
concurrently, and in some relativ 
degree. ; ; 
Concerning the second севар 
also illustrated in Figure 6, the m 
discrimination functions lie abowe tor 
for the nonspeech control M or 
which are purportedly comparable, | iin 
at the phoneme boundary and wit ah 
phoneme Categories. If еа, 
were relying exclusively оп articulato =a 
cues in making the speech legen 
tions and if the articulations were ks 2 
categorical (ie, identical within Pi 
neme classes), then the speech- Hn 
crimination functions should fall, je > 
phoneme classes, at the level of or a 
low those functions for the. o 
stimuli, That they do not again for E. 
the interpretation. i we are to REA 
serve the mediation hypothesis—th ^ 
the listener Sometimes utilizes ptam 
Ceptive information and sometimes d a 
Dot. It is unclear by what means ee 
istener manages to use this нале 
tion just when it has the most ^ 
offer, at the phoneme boundary, and 1 
ignore this information when it Жош 
detract from discrimination, within 
Phoneme Classes, 
Consideration of the dep 
among discrimination functions | 
consonants, vowels, and чевер ол. 
stimuli provides further evidence tha 
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militates against the motor theory of 
speech perception. Concerning the 
first comparison, it has been found that, 
although consonants are generated in 
Natural language by discontinuous ar- 
ticulations and vowels are generated by 
Continuous articulations, enhancement 
of discrimination at the phoneme boun- 
dary occurs with both consonant and 
m stimuli: This finding contradicts 
a outcome predicted by the motor 
Пеогу of speech perception: 


When significant acoustic cues that occupy 
EE Tent positions along a single continuum 
Mni pe by essentially discontinuous 
m ations (as, for example, in the case 
Second-formant transitions produced for 

a by a movement of the lips and for /d/ 
ty tt of the tongue), the percep- 
and ате discontinuous (i.e. categorical), 
shoe iscrimination peaks develop at the 
hand me boundary. When, on the other 
Sent ka cues are produced by move- 
articulat at vary continuously from one 
exampl. Ory position to another (as, for 
second 4 the frequency positions of first and 
EEr produced by various vowel 
continu К? perception tends to change 
Phoneme У and there are no peaks at the 
ima е boundaries [Liberman, Harris, 

5, Lisker, & Bastian, 1961, p. 177]. 


Tmploying synthetic vowel stimuli, 
reme Stevens, and Ohman (1963) 
reas ound, on an identification test, 
eae sharp and well-defined 
воне aries between phoneme cate- 
© Mes,” and, on a discrimination test, 
co at the phoneme boundaries, ie, 
in th mination between adjacent stimuli 
А г vowel continuum is better when 
ктш lie near phoneme boun- 
thetic [p. 13].” In the study of syn- 
реак Vowels by Fry et al. (1962), а 
dicted 1 vowel discrimination is рге- 
ound, in the region of the /e/-/æ/ 
Com ary for the two- and three-step 
Sa Unfortunately, this peak 
eee not be verified by the obtained 
ve, ination data since discrimination 
100 the entire continuum was nearly 
% correct in these comparisons. 
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There is, however, some indication of 
a peak in discrimination in the region 
of the boundary for the one-step com- 
parisons. To raise the ceiling on vowel 
discrimination in the latter study, Cross 
and Lane (1964) copied the tape re- 
cordings employed by Fry and co- 
workers, kindly provided by the 
Haskins Laboratories, while inserting 
a broad middle-formant intermediate 
to the two formant frequencies origi- 
nally comprising each vowel The 
series of degraded vowels was then 
presented to listeners for labeling and 
ABX discrimination. With the overall 
level of discrimination somewhat re- 
duced, marked peaks in the discrimina- 
tion functions were observed at the 
phoneme boundaries. 

It should be noted, however, that 
Bastian and Abramson (1962) did not 
find peaks in the discrimination of Thai 
vowels, minimally distinguished by 
duration. Whether this finding, which 
contrasts with those just reported and 
with the data for nonspeech stimuli 
that follows, should be attributed to 
some unique property of duration as 
a discriminative variable, or to a pro- 
cedural artifact such as near perfect 
discrimination over the entire contin- 
uum, cannot be determined from the 
abstract of the paper they presented. 

Concerning the second comparison, 
cited above, between consonants and 
nonspeech stimuli, it has been found 
that the latter also yield enhancement 
of discrimination at the boundaries of 
their identification gradients. To re- 
turn first to the experiment on dis- 
crimination of /reebtd/—/reprd/ (Lib- 
erman, Harris, Eimas, Lisker, & 
Bastian, 1961) described earlier, it will 
be remembered that a second peak was 
observed in the discrimination function, 
and that this peak was subsequently 
shown to correspond to a third “pho- 
neme category,” labeled */p/ ("the 
unnatural /p/"). The obtained peak 
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TABLE 1 


AVERAGE PERCENT CORRECT DISCRIMINATION 
or NONSPEECH STIMULI, PRESENTED IN 
ABX Triaps TO THREE SUBJECTS 


For A and | For A and 
B stimuli B stimuli 
1 drawn from | drawn from һ=1/ P 
Comparison | "the same | different | 2+ 
“phoneme “phoneme 
class"* classes''* 
% N 
One-step 49.8 239 
Two-step | 66.7 192 
'Three-step | 71.1 91 


In percent identification of A and B. 
дае тевсеіа ра less than 25% (same class) ог 
greater than 60% (different classes). 

** Normal test of differences 
Wadsworth and Bryan (1960). 


between percents, 
in discrimination at the /p/-*/p/ 
boundary is considered, according to 
the motor theory of speech perception, 
to reflect differentiated Proprioc 


eptive 
feedback from diverse articulatory pat- 
terns evoked by /p/ and */p/. It is 


difficult to imagine, however, what 
pattern of articulation has become at- 
tached to */p/ over long years of lan- 
guage conditioning, especially since the 
authors report that "this was a strange 
and unnatural /p/ to American ears 
[p. 186].” In this case, too, the hy- 
pothesis that Speech is perceived by 
reference to its articulatory correlates 
does not provide a convincing account 
of the observed relation between label- 
ing and discrimination, 
Three experiments employing ex. 
clusively nonlinguistic stimuli Provide 
the most convincing evidence that the 
observed relation between labeling and 
discrimination functions is not peculiar 
to the perception of Speech and is not 
predicated on "long experience with a 
language [during which] a speaker 
(and listener) learns to Connect speech 
sounds with their appropriate articula- 
tions | Liberman, Harris, Eimas, Lisker, 
& Bastian, 1961, p. 177]." Lane and 
Schneider (1963) conditioned two 
identification. responses to two non- 
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speech auditory patterns (the fdo- 
/to/ control stimuli) and then a 
ured generalization at intermediate 
stimulus values according to the pro- 
cedure described earlier. Prior By 
identification training in the first ех- 
perimental session, and after that п 
ing in all four sessions, each subien 
was presented the series of contro 
stimuli arranged in triads for ABX dis- 
crimination. Before identification re- 
sponses were conditioned to the 
extreme stimuli in the series, the ar 
crimination functions for the nonspeech 
stimuli showed chance levels of ac- 
curacy. Following training, these 
(ABX) discrimination functions 
showed peaks at the phoneme boundary 
in the one-, two-, and three-step com- 
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(control) sti 
triads to on 


relative freq 
in 


muli when presented in Ey d 
е listener. (Each point ‘ations 
uency of correct discrimina! uli 
Presentations of pairs of S eris 
Separated by two or three steps in the S ion 
of seven stimuli. This subject’s identi is 
gradient, which appears in Figure 9, ~ imi- 
also plotted to reveal that peaks in dd о! 
nation occur at the “phoneme boundary tion 
the Donspeech stimuli, The discrimina in 
functions Predicted from the identifica 

gradient are shown by the dotted lines.) 
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1.00 }- 


ad 


[eo] 


1.00 j- 


RELATIVE FREQUENCY OF /bab/ RESPONSE 


oor 


41.00 


RELATIVE FREQUENCY OF CORRECT АВХ DISCRIMINATION 


4 5 6 T 
STIMULUS NUMBER 


Fic. саи fscri 
12. Identification and discrimination of visual stimuli by individual subjects. 


4 5 6 Y 


(Each 


point on v odin Е 
the identification functions, filled circles, solid lines, shows the relative frequency of 


the 
he /b A b/ response in 24 presentations of t 


he corresponding stimulus. 


Each point on the 


Iscriminati i i 
eta ae i functions, open, circles, dashed lines, shows the relative frequency of correct 
ions in 36 presentations of each pair of stimuli.) 


= that the three listeners discrimi- 
rim: == readily between stimuli 
Pane Шз opposite sides of their 
om y 1) phoneme boundary," 
uli wer Figure 9, than between stim- 
Feed from within a “phoneme 
Басе The labeling and ABX 
Pie ns (solid lines) for the subject 
adn td the most orderly dis- 
п. ation data are shown іп Figure 


" кє al. (in press) also obtained 
ahd si between discrimination 
stimuli. eling | employing nonspeech 
hess eH this case visual patterns. 
of fo investigators conditioned each 
Wher subjects to respond /bAb/ 
o ү dms with deleted sectors of 42° 
Whe were presented and /gAg/ 
n those with sectors of 50° and 54° 


were presented. When this labeling 
repertoire was thoroughly established 
(50 consecutive correct identifications), 
the stimuli were presented in a per- 
muted order for identification and then 
again in triads for ABX discrimination. 
The outcome is shown in Figure 12: 
For all subjects the stimuli that evoked 
different labeling responses were more 
accurately discriminated than those 
that evoked the same labeling response. 

The correlation between labeling and 
discrimination has been observed with 
yet another nonlinguistic continuum : 
wavelength. Beare (1963) reports 
labeling gradients for spectral colors 
that intersect at 450, 490, 575, and 620 
my. Ekman (1963) obtained functions 
of similar form when he extracted five 
factors from a matrix of similarity rat- 
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ings obtained for spectral ыч and 
then plotted factor loadings as a ш 
tion of wavelength. The mS д 
both Beare (1963) апа Ekman ( “ 
show intersections of adjacent gradi- 
ents at wavelengths that turn out to 
correspond to minima in the functions 
relating the size of the jnd to wave- 
length (Jones, 1917). Thus, the dis- 
crimination of wavelength is most acute 
at the boundaries between color classes. 
Since the correlation between dis- 
crimination and labeling is observed 
for nonspeech as well as for speech 
stimuli, the correlation per se does not 
seem to warrant the postulation of a 
special perceptual mechanism for the 
discrimination of speech stimuli, 


Correspondence of Identification. and 
Discrimination Obtained with Conso- 
nants, Vowels, and Nonspeech Stimuli 

Nor does the degree of correspond- 


ence between labeling and discrimina- 
tion provide a basis for 


distinguishing 
among the perception of consonants, 
vowels, and nonspeech stimuli and 


therefore a basis for Postulating a spe- 
cial mechanism in the case of speech, 

It must be acknowledged that an ini- 
tial comparison of the obtained vowel- 
discrimination functions and those pre- 
dicted from the labeling gradients 
shows a poorer correspondence in the 
case of vowel stimuli (in the one set of 
findings published in detail) than in 
the case of consonant stimuli, as Fry 
et al. (1962) have noted, However, 
the apparent difference between the 
degree to which label; 


nation correspond in vowels versus 


consonants may be attributed to several 
concurrent artifacts : 


1. The predicted discrimination func- 
tion is derived from the averaged label- 
ing gradients of preselected Subjects, 
As discussed earlier, the criterion for 


preselection, based on “steepness,” has 


HARLAN LANE 


not been (and probably could not x 
comparable in experiments with зе s 
and consonants. Therefore, the w d 
of the average labeling godes e 
the predicted discrimination func p 
are also not comparable xc e. 
periments. Had Fry et al. (196 ) 17 
jected not 9 but 10, say, ont 
subjects, their predicted discrimina o 
functions for the vowels would a 
been higher; had Liberman, шы 
Eimas, Lisker, and Bastian (1961) 3 
tained not 12 but 13 of their 31 vd 
jects, their predicted functions for Ew 
consonants would have been lov "zd 
Since the experiments are oa of 
parable with respect to the heig ve 
the predicted discrimination func = Е 
they are not comparable with pose 
the correspondence of these functi 
to those obtained. ПИР 
2. The predicted discrimination p 
tion is based on three assumptions i 
cussed earlier, and each of tes а 
sumptions is violated by the pe 
mental conditions, Тһе first— P» 
speech stimuli are discriminated to tly 
extent that they are labeled dieren 
—has been disconfirmed by every hat 
periment. This is evidence of W E 
Was obvious beforehand, that Lap 
continua differ in their psychophysic? 


Е А ап 
Properties. Contrasting vowels an 
consonants psychophysically, we 

fo 


T example, that the relevant cue icd 
last more than 14 second in the pi^ - 
Case, less than М second in pa 
Moreover, vowels differ in their ter- 
trinsic” amplitudes (Lehiste & Pes 
Som 1959) by as much as 6 decibels 
Whereas the Stop consonants do die 
Ty and his collaborators chose to in 
low these amplitude inches tic 
their Tecordings of triads of synthe ce 
Speech stimuli, with the ce 
that—according to measurements of 
these tapes in our laboratory—45 % r- 
the triads could be discriminated СО 
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rectly on the basis of amplitude cues 
alone? These and other psychophys- 
ical cues tend to separate the obtained 
discrimination functions from those 
predicted in the case of the vowels but 
€ss so in the case of the consonants. 
Thus, the correspondence between la- 
beling and discrimination may differ 
mM these experiments for reasons 
at are irrelevant to a motor theory of 

Speech perception. 
E^ The second assumption on which 
"i — discrimination functions 
,based is also disconfirmed by ex- 
perneta] „evidence in the case of 
owel discrimination. This assumption 
ре Hu the phoneme classes to 
libel stimuli are assigned during the 
«Cn Mao are the same as those to 
e they are assigned during the dis- 
pese task. We presented the 
ean Eo series of synthetic vowels 
in ot sud et al. (1962) toa subject 
to ide aboratory with the instructions 
clin ntify each stimulus either by cir- 
он of the following words which 
Bii. di in a row on his answer sheet, 
eda P ek or by circling 
E between adjacent words if the 
e Ew to fall in a class between 
mw he result: nearly equal peaks 
р eae probability for the last four 
Sido е classes, with a series of sec- 
medi y peaks for HEED and the inter- 
ate classes, When the subject is 


10 
tensis met 82% of the triads showed in- 
Vowel changes correlated with those in 
Vowel quality, Since the difference limen for 
(Flana ntensity is approximately 1.5 decibels 
Cases Sr. 1955), we counted only those 
in à ү Which the correlated intensity change 
ted exceeded 1.5 decibels, yielding a 
in whi awe estimate of the number of triads 
the obi „intensity cues might have enhanced 
teseh 161 discriminations. More recent 
Vowels on the discrimination of synthetic 
tory Ste the Speech Transmission Labora- 
plit tockholm, which also allowed these 
timina fluctuations, found comparable dis- 
E ation functions (K. N. Stevens, per- 
Communication, January 1965). 


in 


am 
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not seriously constrained in his labeling 
responses, the number of labeling gra- 
dients is largely a function, of course, 
of the stimulus range, but also, secon- 
darily, of the naturalness of the syn- 
thetic stimuli which, in the experiment 
by Fry et al. (1962), was admittedly 
“adequate but no more [p. 178]." Had 
these investigators permitted their sub- 
jects a greater number of phoneme 
labels and then derived the predicted 
discrimination function accordingly, the 
disparity between the latter and the ob- 
tained function would be considerably 
reduced (the predicted functions would 
be elevated). The same may not be said 
for the consonant contrasts that have 
been investigated, however. Therefore, 
the artifact which we have been dis- 
cussing makes the results for con- 
sonants and vowels seem disparate. 

4. Finally, the predicted discrimina- 
tion functions for the vowels violate a 
third assumption of the prediction for- 
mula, “the various stimuli within each 
triad are perceived independently of 
each other [Liberman et al., 1957, p. 
363].” In the same article in which 
Fry and others remark on the disparity 
between obtained discrimination func- 
tions and those predicted on the as- 
sumption of independence, they present 
conclusive evidence of the noninde- 
pendence of vowel identifications within 
each triad. Based on a related study 
by Cross and Lane (1964), we may 
estimate that had Fry et al. (1962) 
acknowledged context effects in their 
predictive formula, the apparent dis- 
parity between predicted and obtained 
discrimination functions would have 
been reduced 10 to 15%. The finding 
of greater context effects in the percep- 
tion of vowels than in the perception of 
stop consonants is indeed significant, if 
its generality can be shown, but its rel- 
evance for a motor theory of speech 
perception remains to be stated and 
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demonstrated. Since the relational na- 
ture of vowel perception lies behind the 
apparent disparity _between predicted 
and obtained discrimination functions 
and therefore behind the tendency to 
view vowels as perceived continuously 
rather than categorically, Lisker et a]. 
(1962) had it just backward when they 
wrote: "We suspect that the continu- 
ous (as opposed to categorical) nature 
of vowel perception lies behind these 
examples of the tendency to perceive 
vowels relationally [Footnote 33]." 
Many of these obstacles, which block 
à direct comparison between consonant 
and vowel continua with respect to the 
correspondence of labeling 
ination, can be circumvent 
Parison of consonant с 
appropriate nonspeech continua, 
discussed earlier, Lane 


and discrim- 
ed in a com- 


As 
and Schneider 
h a comparison 
of the /do/—/to/ 
a series of com- 
he same param- 


It follows 
ead to different de- 
grees of disparity between predicted 
and obtained discrimination functions, 
This is illustrated in Figure 11 for the 


it will be Seen 


Tespond well to 
Tison with simi. 


/to/ continuum 
reported by Liberman, Harris, Kinne 3 


and Lane (1961) and for the /h/--/]. 
/g/ continuum shown in Figure 5 of 
this review confirms that Speech and 
nonspeech stimuli do not differ in this 
regard. 
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Since the degree of correspondence 
between discrimination and bee 
does not differ appreciably for че 
and nonspeech continua, the degree 
correspondence does not provide. | 
basis for the postulation of a specia 
perceptual mechanism for the discrim- 
ination of speech stimuli. 


Gradients of Identification Latency n 
Consonants, Vowels, and Nonspee 
Stimuli 


Because of the discrimination peaks 2 
the phoneme boundaries, the incoming [co А 
sonant] sounds аге heard categorically В 
and they are therefore quickly and see 
sorted into the appropriate phoneme 0! 


(Liberman, Cooper, Harris, & MacNeilage 
1963, p. 3]. 


The correspondence among identifi- 
cation latency, discrimination enhance 
ment, and identification probability, 
often adduced in characterizing wally 
sonant perception; describe oy! 
well the perception of vowels and T 
nonspeech stimuli, This further e 
formity in the behavioral effects of ‘a 
three classes of acoustic continua adds 
another dimension in which they are 
congruent. é 

Cross and Lane (1962) reported » 
Systematic relationship between the ге А 
ative frequency and the latency of anh 
tification responses evoked by stimu 
from a synthetic speech пили 
/0/-/to/: Latencies were minimal ja- 
stimulus values corresponding to p! 
teaus in identification (the “paneme 
Classes"), and they were maximal te 
Stimulus values corresponding aes 
boundary of the identification gradient. 
Studdert-Kennedy et al. (1963) ^ 
Teported comparable findings, both v 
Synthetic speech consonants, /b/, d r 
/£/, and vowels, /i/, /1/, /e/: "Fo 


d d 
both Stops and vowels, voice an 


a Liberman, Cooper, Harris, and e 
eilage (1963) ; Liberman, Harris, aN 
Lisker, and Bastian (1961); Eimas (1 
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button-press reaction times were lower 
near phoneme centers than in the vicin- 
ity of phoneme boundaries [p. 33]." 
Gradients of identification latency for 
consonant and vowel stimuli are com- 
parable not only to each other but also 
to latency functions for nonspeech stim- 
uli. Cross and Lane (1962) reported 
gradients of identification latency for 
Pure-tone intensities that also reached 
Minima within identification plateaus 
and maxima at the boundary of the 
identification functions. Lane and Kopp 
p later replicated these findings. 
ан (1963) obtained comparable 
1 ings for the relative frequency and 
fatency of responses during color nam- 
ing. A psychophysical study by Kellogg 
(1931) obtained a similar relation be- 
I identification frequency and la- 
i с Kellogg used seven fixed pairs 
a intensities that varied in the 
= = ratio between the members 
is ew His subjects responded 
а eft side darker” or “right side 
‘inet to each pair, and latencies were 
ne : d. When the probabilities. of 
tite Wo responses approached equality, 
ncies were maximal; when response 
р were most disparate, 1а- 
les were minimal. 
"m ү experiment on identification 
joa forms cited earlier, Cross et al. 
Ges ; also determined the laten- 
re E their subjects' responses which 
mae either a consonant contrast, 
ARRAS or a vowel contrast, 
Ab/-/bib/. These investigators 
Conclude: 


a gi omnes latency functions obtained in 
Sona nts y, with both vowel-contrast and con- 
maxi -contrast labeling responses, reach a 
fondi s at the "boundary of their corre- 
mun ing identification gradients and a mini- 
п within the plateaus of those functions. 
iat findings for response latency in 
cord ication of nonspeech visual forms ac- 
УҢ with those reported for identification of 
hetic speech stimuli . . . Гр. 72] 


ic 
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The same relationship between iden- 
tification frequency and latency for 
nonspeech stimuli was obtained by 
Cross and Lane (1962) under condi- 
tions that permitted measurement of 
changes in response topography. Two 
vocal responses whose articulations 
were continuous were employed : hum- 
ming with a fundamental frequency of 
147 cycles per second and humming 
with a fundamental frequency of 227 
cycles per second. These responses 
(+2 cycles per second) were first 
differentiated out of the subjects’ vocal 
repertoires by selective reinforcement, 
and then conditioned discriminatively 
to noise bursts at 50 and 80 decibels 
(SPL), respectively. Measurements of 
response probability, latency, and 
topography (fundamental frequency), 
made during subsequent generalization 
testing, are represented for three sub- 
jects in Figure 13. Inspection of the 
figure reveals, first of all, the character- 
istic relations between identification 
frequency and latency obtained in 
studies employing consonant, vowel, 
and nonspeech continua. Most im- 
portant, it will be seen that there 
was no tendency for the responses to 
change topographically with changes in 
the stimulus. This finding with non- 
speech stimuli accords with the out- 
come of an experiment, discussed 
earlier in this article, by Harris et al. 
(1961). These investigators made 
electromyographic recordings during 
mimicry of stimuli from the synthetic 
speech continuum /sht/-/spht/. They 
found that “either there was a normal 
burst of muscle potential at the lip 
(indicating a /p/ gesture) ог there was 
not [p. 842]." Since pairs of vocal 
responses that are topographically con- 
tinuous, as well as pairs that are not, 
fail to show blending at stimulus-iden- 
tification boundaries, it seems that the 
discreteness of an identification ге- 
sponse is more a matter of response 
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Stereotypy than it is of articulatory 
mechanics. 

à The general finding that the proper- 
les of identification, discrimination, 
latency, and response topography 
which are obtained with synthetic con- 
Sonant stimuli are also replicated with 
synthetic vowels and with nonspeech 
Stimuli is directly counter to the formu- 
lation of the motor theory of speech 
Perception and the predictions which 


follow from it presented earlier in this 
article, 


Psychophysical Functions for Speech 
oudness and their Relation to Vocal 


Effort 


Whereas the motor theory of speech 
ig gs propounded by Liberman 
his co-workers would sort the 
one of vowel quality and stress 
tion) one class (continuous articula- 
othe and that of consonants into an- 
Lo (categorical articulation), Lade- 
i n aes: excludes vowel quality 
ant ы e domain of the motor theory 
oe places stress in a common class 
Eig € consonants, despite the differ- 
ces in the articulatory responses that 
Produce them: 


most d quality, nasality, and. pitch are 

conste lY related to an ordering of the 

most со properties of the stimulus, whereas 

Stress A. ee qualities and features such as 

articul re ordered more simply in terms of 
ulatory activity [p. 406]. 


The evidence seen as favoring this 
ров has been cited earlier, and 
€ countervailing findings may now 

© described, 
et x was reported earlier that Lane 
estin (1961) found that the speaker's 
i ad of his own vocal level, his 
г оше output, grew as the 1.1 
MR of the sound pressure produced. 
(196. Ladefoged and McKinney 
of e found that listeners' estimates 
as кы loudness of speech sounds grew 
he 1.2 power of the sound pressure 


.provided by experiments that 
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presented. These authors concluded 
that their listeners were, in fact, esti- 
mating the effort required to produce 
the sounds that were presented to 
them.? However, Lane et al. (1961) 
also obtained listener estimates of 
speech sounds, varying in autophonic 
level and in sound pressure level, and 
came up with a different finding: Lis- 
tener estimates of the loudness of 
speech signals grew as the .7 power 
of the sound pressure—a function quite 
similar to the sone scale of pure-tone 
loudness. 

By using magnetic recording tech- 
niques to decouple autophonic levels 
from their corresponding sound pres- 
sure levels, Lane (1962) was able to 
vary both parameters independently 
and to assess the contribution of each 
to estimates of speech loudness. When 
autophonic level was varied from whis- 
pering, at the one extreme, to shouting, 
at the other, the quality (spectrum) of 
the voice inevitably varied, but listener 
estimates of speech loudness remained 
invariant. These findings, replicated 
several times, show that estimates of 
the loudness of speech signals are not 
under the control of the effort involved 
in producing those signals but rather 
under the control of the sound pressure 
levels of the signals. 

Further evidence that estimates of 
loudness are not estimates of effort is 
require 
subjects to match their autophonic level 
to sounds of varying loudness and by 
complementary experiments that re- 
quire subjects to decrease autophonic 
level, under increases in sidetone level, 
to maintain sidetone loudness constant. 
Ladefoged and McKinney 
(1963) contend that estimates of speech 
loudness are actually estimates of vocal ef- 
fort, Warren (1962) has recently argue 
that estimates of vocal effort are actually 
estimates of loudness. For evidence that 
the motor theory works no better when stood 
on its head, see Lane (1963). 


12 Whereas 
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Both kinds of equal-sensation functions, 
which turn out to be reciprocal Em 
functions, have an exponent (|.5|) 
predicted by the ratio of the u—À 
of the autophonic scale and the loud- 
ness scale (Lane et al., 1961). In 
other words, not one but two disparate 
psychophysical functions are required 
to predict the behavior of the subject 
who listens to the loudnesses of speech 
signals and matches or regulates them 
by dint of his own vocal effort. | 
The final experiment that was cited 
previously as providing support for 
the inference of mediating articulation 
in the perception of stress was con- 
ducted by Lehiste and Peterson 
(1959). These investigators found, it 
will be recalled, that when two vowels, 
generated with unequal effort, were 
presented to listeners at the same sound 
pressure level, the vowel generated 


with the greater effort was perceived as 
louder. 


The procedure employed in 
this expe 


riment does not permit us to 
assess the relative contributions to the 
perception of loudness from sound 
Pressure level and from effort cues 
when both are varying, as in natural 
language. Furtl 

atic difference w 
Vowels in each 


: Ог example, 
Pair with the higher 
cen judged louder 
13 Contrary to a Suggestion p 

i y Тайе; 
and McKinney (1963, P. 460) this Mine 
cannot be attributed to а failure by the 
subjects to perceive the stimuli 

them isolated vow. 


» Some of 
à els, as Speech. Lane et al. 
(1961) cite five experiments by other in- 
vestigators who used ; 


s nonosyllables ог con- 
nected discourse—McMahon 


wich and Taylor 
(1963) have made it six—all of which con- 
firm the basic disparity between the dynamics 
of speech and hearing, 
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simply because this was the only S 
criminable respect in which one мө in 
was in any way "greater" than 
other. : Е 
In order to explore the relation I 
tween effort cues and sound ace s 
determining judgments of vowel ev 
ness, Lane (1962) recorded the Qe 
neme /a/, produced at very pex dr 
very high levels, and played eac id 
these at various intensities to [йел 
The slope of the loudness pend 
relating estimates to sound | pU ee 
levels, was invariant under wide di sio 
ences in the levels at which the Pod 
were originally produced. зао had 
sequent experiment, Lane (196: » at 
speakers vocalize the phoneme /а 40- 
a wide variety of levels over pes 
decibel range and then played dis 
vowels at a constant sound [een 
level to listeners who were атан 
to give magnitude estimates of in ns, 
loudness. Under these conato 5 
with sound pressure level constan t€ 
effort or autophonic level pecie 
magnitude estimates did е ind 
Slightly as a function of the ae 
Sound pressure level; the "lou These 
scale had a slope of less than 0.1. nod of 
findings indicate that the by Lehiste 
Pair comparisons employed by islead- 
and Peterson (1959) gave Ah loud- 
ing image of the degree to whic 
ness judgments can be xx БЕГ 
Secondary properties of Le a sma 
Tatio-scaling techniques revea 


1 
o voc? 
effect which may not be related t 
effort. 


IONS 
Summary AND CONCLUS 


> iated 
Vocal responses may be differen crimi- 
topography and then brought | ied com- 
Native control either by the ме е labora- 
munity or by an experimenter 1n uli, draw? 
tory. If two discriminative ш two dif- 
from a Physical continuum, contro observer 
ferentiated vocal responses by an ontinuu™ 
and stimuli intermediate on the aracteristi© 
are presented to him, certain “lowing Ў 
Telations obtain among the fo 


in 
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pendent variables: stimulus identification, 
stimulus discrimination, identification latency, 
and response topography. 

Identification of stimuli from a synthetic 
speech continuum, which ranges between two 
phonemes, constitutes a special case of this 
general paradigm in which (a) discrimina- 
tion training and response differentiation 
have been conducted by the language com- 
munity over a period of years, and (b) the 
identification response is formally similar to 
the acoustic stimulus. Under these condi- 
tions, it is tempting to speculate that the 
vocal responses, which in the language com- 
munity generate the extreme stimuli on the 
Continuum, are active in the perception of 
those stimuli when they are generated by 
a device or another person. This hypothesis 
has found some support in inferences drawn 
from the relations among the dependent 
variables enumerated above. 

On the other hand, the inference that 
Stimuli are perceived in terms of their articu- 
atory correlates is unlikely to be formed in 
Comparable experiments in which (a) dis- 
tamination training and response differen- 
on are accomplished quickly in the 
aboratory and (b) the identification re- 
Sponse is not formally similar to the discrim- 
ative stimulus. Nevertheless, the same 
[cations among identification, discrimination, 
nae and topography are found in both 
and 5 of experiments ; those employing speech 

n _ those employing nonspeech continua. 
te ше basically the same conditioning and 
inden procedures are employed in both 
stud of experiments—the speech perception 

les are a special case of the general 
Paradigm—and since the same relations 
ene the four dependent variables are found 
ater we conclude that these relations are 
es tbutable to the general paradigm for dis- 
iminative training and testing and that 
me, ostulation of а special perceptual 
Š з ыы, for the discrimination of speech 
uli is unwarranted. 
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Hull assumed that the theoretical variables D and K combine. in р 
multiplicative manner while Spence has suggested that the a А 
additive. The great majority of studies dealing with this issue hav 


supported Spence's view in that the: 


action. 
experiments of Spence's i 


concerned with the combinati 
the findings of recent experi 


y have obtained no D X K inter- 
Experiments which haye involved “0” 


and reward, however, have reported such an int 
paper is concerned with describing the implicat 


values of deprivation 
eraction. The present 


In addition, it derives 


ments concerned with the possibility that 
acquisition-drive level may | 


on performance under subseq 


The manner in whi 
variables incentive motivation (K) 
and drive (D) combine to produce 
reaction potential (E) has been the 
subject of considerable experimental 
interest in recent years. One obvious 
Source of this interest lies in the fact 
that both Hull (1952) and Spence 
(1956) have advanced explicit, but dif- 
ferent, formulations of the rel 
Thus, according to Hull, K and D com- 
bined multiplicatively, while Spence has 
Suggested that the relationship is addi- 
tive. A second factor that has facilitated 
such research is the apparent ease with 
which the experimental problem can 
be defined, Thus, when suitable opera- 
tions for manipulating D and K are 
selected, the factorial design appears 
to provide an ideal experimental test 
of these alternative assumptions. For 
such investigations, typically involving 
two or more levels of both D and K, 
Hulls multiplicative formulation leads 
to the expectation of a significant D 


ch the theoretical 


ationship. 


1The writer is indebted to Kenneth W. 
Spence who read the present manuscript and 
provided helpful comments and suggestions, 


nave an historical or perseverating effect 
uent conditions of drive, 


X K interaction while Spence's p 
tive assumption predicts no interacti їй 
Stated graphically, if performance a 
plotted as a function of D idm 
the parameter, Spence’s ай vm 
formulation predicts parallel vies 
while Hull’s multiplicative assump 

requires that the functions diverge: 


Theoretical Relationship between 
D та К 


As noted later, the bulk of c 
from "typical" factorial studies be 
Supported Spence's position. ae 
theless, several experiments do s sm 
an interaction between time of -— 
tion (Td) and magnitude of T 
and have been interpreted A3 in 
Strating a nonadditive relationship ted. 
tween D and K, Tt must be no has 
however, that although i epe the 
assumed that the combination O ddi- 
theoretical variables D and K is a ach 
tive, his treatment of K is Hot P gn 
as to preclude the possibility o ita 
interaction between the exp a 
variables employed to manipulate | sent 
theoretical variables, The pre 
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paper is concerned with describing the 
conditions under which an interaction 
of these empirical variables is not only 
compatible with Spence's interpreta- 
tion of these variables but is, indeed 
implied by it. ' 
ү шше to Spence (1956), K is 

e molar theoretical variable that 
Summarizes the motivational conse- 
fenes of the r, — sy mechanism which 
S assumed to consist of implicit com- 
ги of the overt consummatory ге- 
te zh (Re) that become conditioned 
eem stimulus situation in which the 
Eos response 1S appetitively 
as rced.? The strength of the rg — Sg 
а anism depends, in turn, upon at 
ED variables. First, since it 
Sod n from classical conditioning to 
des е cues, the strength of fg— Sg 
ie € an increasing function of the 
му m and duration with which Re 
оке een elicited in the presence of 
PA = Thus, K is assumed to be 
of сш function of the number 
Ud d trials and approaches an 
Bu ptote determined, in part, by the 

gnitude of reward. Second, since 

€ strength of r,— s, is assumed to 


2 
on a 7, — s, mechanism is an abstract 
hot re retical concept which, as such, does 
ion Vue explicit. speculation or specifica- 
о ко its physiological nature or locus 
has Ep It is true, however, that т; 
AER рач опаПу been considered to consist 
5 оао ега1 responses such as salivation, 
or exa contractions, etc. Miller (1963), 
Кесе has recently presented an in- 
Sembles ion of reinforcement which re- 
m in part, Spence's treatment of the 
» mechanism. Miller notes, however, 
E analysis specifically "does nof limit 
dies; incentives based on the conditioning 
Ip. pheral consummatory responses . - + 
C NR Actually, the nature of Spence's 
to Ne of r,— s, does not compel him 
Sbonses that mechanism to peripheral re- 
Dlicit a nor has he so limited himself by ex- 
dris ee The present paper also 
Central | that туз may be both peripheral or 
veloped m nature and that the argument de- 
oh: ere is equally applicable to either 

implicit response. 


Y 
that 
itself 


reflect the vigor of the overt consum- 
matory response (Re) of which it is 
a component, any variable which con- 
tributes to the vigor of Re will also 
affect the theoretical value of K. 

A possibility implicit in this formula- 
tion which has not been previously 
elaborated is that the procedures em- 
ployed to produce D (eg. Td) are 
also determinants of K in instrumental 
reward conditioning. There are sev- 
eral bases on which such a relationship 
might be expected. Thus, since the 
consummatory response (Re) is an 
overt, skeletal response, its strength or 
vigor should vary directly with D in 
the manner typical of other such re- 
sponses. Itis well known, for example, 
that the amount of food or water which 
a subject will consume varies directly 
with length of deprivation, and Stellar 
and Hill (1952) have concluded that 
this type of consummatory behavior— 
at least in the case of thirst—provides 
the most accurate, available index of 
drive level. Similarly, Snyder (1962) 
has reported that the rate at which rats 
licked for a saccharine reward is an 
increasing function of D, although 
Stellar and Hill (1952) and Collier 
(1962) have found licking rate to be 
relatively independent of D. Even in 
cases in which the vigor of Rg is not 
strongly affected by Td, however, Td 
may still act as a determinant of K, 
both through its facilitation of r, di- 
rectly and through its effect on the 
height of the gradient of generalization 
of r, from the goal box to the runway, 
etc. 
Level of deprivation may also con- 
tribute to K by modifying the “palat- 


ability" of incentives or the threshold 
f consummatory 


for the occurrence O 
behavior. Thus, there may be no food 
incentive sufficiently “palatable” to 


evoke a consummatory response in a 
“completely” satiated subject while 
as satiation decreases, increasingly less 


312 


palatable incentives may exceed the 
threshold for Rg. Evidence of such an 
influence of Td on the palatability or 
incentive value of rewards is reported 
by Stabler (1962) who found that run- 
way performance was a decreasing 
function of sucrose concentration under 
low Td, but an increasing function of 
concentration under high Td. One 
possible interpretation of this finding 
is, of course, that the incentive value 
(or, K) associated with different 
Sweetnesses depended on Td. Further 
corroboration of this finding that the 
palatability or incentive value of a re- 
ward varies with Td was reported by 
Snyder (1962) who found that the re- 
inforcement value of a presumably non- 
nutritive reward (saccharine) was, like 
that of sucrose, a function of Td. Un- 
like the result with Sucrose, however, 
performance under high Td was a 
decreasing function of Saccharine con- 
centration, while with low Td perform- 
ance increased with concentration. 


Empirical Relationship between Depri- 
vation and Reward Magnitude 


ant empirical findings, 
es of studies 
es have been 
he view that 
St when ex- 
variables are 
td, Shea, and 
ally combined 
th two levels of 


employed. Thus, Sewa; 
Elkind (1958) factori 
two levels of reward wi 
deprivation in a Tunway-conditioning 
situation. Magnitude of reward was 
either 0 or 1.00 gram(s) of wet mash 
while D was varied by testing the sub- 
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jects either immediately before or after 
administration of their total daily i 
tion, thus providing one group ipa 
was 23-hours hungry and one which 
was 0-hours hungry and, presumably, 
satiated. Letting lowercase letters ted 
dicate low, and capital letters, ne 
values of the theoretical variables Ё 
and К, the groups were: DK, Dk, dK, 
and dk. For the latter three Veg 
little evidence of the acquisition of t и 
instrumental running response ps 
found and only slight differences a 
tween the groups occurred, pe ы 
performance was somewhat better d 
Dk. Group DK, on the other han : 
produced a definite acquisition ar 
and ran considerably faster than t E 
other groups. The fact that Groves 
dk, dK, and Dk were about equal a 
inferior to DK gave rise to a significa a 
D X K interaction, since the gen 
between, for example, DK and Dk ee 
thus necessarily much larger than tha 
between dK and dk. РР 
While Seward et al. (1958) ше 
preted these results as suggesting "las 
Spence’s additive assumption арр" А 
only for intermediate levels of a a 
tion and reward, their data are actua ка 
quite compatible with the assumption 5 
that D and K combine additively while 
is a function of both reward 4 
tude and deprivation level. Thus, if | 
O-hours deprivation groups were nd 
or completely satiated, it is aire 
that the subjects in those groups p 
Sumed much or any food during Ше 
-9% Seconds they remained in. es 
baited goal box, In this event, e 
@ Was not evoked, K would deve "a 
no more than for the subjects in E 
"zero reward" groups and perium 
Group dK should show performan 
essentially identical to that of amni 
dk while neither should improve we i 
training, no “reinforcement” being а i- 
ministered, Group Dk should xe 
larly fail to improve with training, 
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шк expected to exhibit a higher 
one, performance than dK or dk 
ite d its higher level of D. 
with зер DK should improve 
ede ШЕ аз а result of its acquisi- 
aA ae and should be significantly 
an e o any of the other groups, 
hese ESE groups acquire no K. 

al roseus were clearly con- 

sd by the Seward et al. study, and 
є. es interpretation may be placed 
Raa ed studies by Seward and 
or (1960) and Seward, Shea, 


and D 

below. avenport (1960) mentioned 
үү . 

Nes the assumption that K is 

басени: determined by D сап ар- 

toga r Pre for the interaction 

zero v those studies which involve 


De "ign of deprivation and reward, 
Which E numerous experiments 
Variables РОЛ no zero values of these 
typically" no DxK interaction is 
ee E. neqne Thus, with D de- 
K by : egree of food deprivation and 
ieee mue of food reward, no 
1960) R was found by Logan 
en то зоа and Pavlik (1960), 
manipulat 1%). When D was 
es by differential reduction 
nificant | Ject's body weight, no sig- 
Sucrose interaction was found with 
tich, Ф concentration (Brush, Good- 
пог Re alts р, & Eisman, 1961) 
(Vireo magnitude of food reward 
Snyder wa Freygold, & Blum, 1962). 
ime for ( 962) varied deprivation 
Tation 7 and water and concen- 
to obt „of saccharine and also failed 
На, E the interaction predicted by 
and Bada the other hand, Ehrenfreud 
action p ia (1962) did obtain an inter- 
tion in etween D, as defined by reduc- 
foog rst body weight and amount of 
PPosite 21% but the interaction was 
гус © t0 that predicted by Hull—i.e., 
Or groups trained under small 

large rewards actually con- 
with increasing D—a result 


Versus 
Vergeq 


313 


these experimenters interpret as a case 
of "exponential addition" (Spence, 
1953). Finally, while the above ex- 
periments all involved runway condi- 
tioning, similar results were obtained 
by Hulicka (1960a, 1960b) for a bar- 
pressing response. 

It is apparent, then, that in the 
typical experiment employing no zero 
levels of reward or deprivation, а 
Deprivation X Reward interaction is 
not obtained. The present hypothesis, 
however, assumes that K varies with 
deprivation level as well as reward 
magnitude—an assumption which may 
appear to imply that a Deprivation X 
Reward interaction should be expected 
generally. In the type of experimental 
situation in which no Deprivation X 
Reward interaction has been found, 
however, none is, in fact, required by 
the present analysis. For expository 
purposes, for example, consider a 2 
x 2 factorial design involving “low” 
and “high” values of both reward 
magnitude and deprivation level. As- 
suming that K is a function of both 
deprivation and reward, that K and D 
combine additively, and that а nu- 
merical weight of 1, 2, or 3 may be 
assigned to the low, medium, and high 
values of each of these variables, the 
theoretical expectations for such an 
experiment are represented in Table 1. 
Note that in each cell D has a value of 
either 1 or 3, since this variable is 
independent of magnitude, 
while K assumes values of 1-3, reflect- 
ing both magnitude of reward an 
deprivation level. The total in each 
cell represents the additive combination 
of D and K, and the marginal figures 
indicate differences between cells adja- 
cent in rows ОГ columns. Inspection 
of these marginal differences indicates 
that no interaction of level of 


and reward magnitude is 
Moreover, t 


his analysis 
e generalized to 


reward 


clearly 
deprivation 
predicted. 


can readily b more 
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TABLE 1 


IMPLICATIONS OF THE PRESENT ANALYSIS 
REGARDING THE THEORETICAL Сом- 
BINATION OF DEPRIVATION 
AND REWARD 


Magnitude of reward 
Level of 
deprivation 
Low High Diff. 
K=1 K=2 
Low + + 1 
1 1 
pe чу 
К=2 К=з 
High + + 1 
3 3 
D= => 
5 9 б 
Diff. 3 3 


hand, does not far 
evidence reviewed 


onnection with the 


f and his associates 
are not directly applicable 


pretation Hull woul 
upon their data, 

formulation does а 
deriving the results 


Ac- 
е ($н-р), 
ntive (K) 


ultiplicative 
fashion to determine reaction potentia] 


(S®-R). Thus a zero value of any one 
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of these variables would reduce S®-R to 
zero and result in performance at aU 
"operant" level. Since Seward and his 
associates! experiment involved three 
groups (dk, dK, and Dk) which had 
at least one of these variables approach- 
ing zero value, Hull's theory would 
predict that the performance of bie 
Sroups would be about equal, her 
not improve with training, and heo: 
be inferior to Group DK for which 
neither D nor K was zero. These pre- 
dictions are, of course, similar to those 
deriving from the present assumption 
and were confirmed by the results О 
Seward et al. Sif 
The experimental result which is Г 
primary embarrassment to Hul x 
formulation is the fact that an qd 
action between reward magnitude an 
Td is typically observed only when а 
zero value of one of the variables 15 
employed. For example, Seward, p wee 
and Davenport (1960) employed thre 
levels of both drive and reward i 
factorially designed, runway-con is 
tioning experiment. Finding a Бер 
vation X Reward interaction of up 
mal statistical reliability for the ges 
3 X 3 table, these experimenters t 
amined the nine simple 2 x 2 intera 
tions which their design provided. hat 
the three simple interactions t th 
proved significant, two involved pom 
zero reward and the prefeeding of ue 
subject its entire daily diet (1 hour ind 
cess to food) just prior to trai nt 
(ie., 0 Td), while the third significa : 
interaction also involved the pad es 
deprivation group. Although the ! 
teraction observed when Td or px 
Was zero is compatible with both Hu to 
and the present analysis, the failure " 
Obtain an interaction in the six EE 
Parisons not involving zero values vs 
forms only to the implications of | li- 
latter view and not to Hull's multip Е 
Cative assumption, Furthermore, ps 
implications of Hull's formulation а 
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08 по support from the тоге соп- 
жы ional factorial studies since these 
uos MD considerable con- 
месни not observed the inter- 
E о Td and reward magnitude 

quired by his position. Thus, Table 
Seon the predictions deriving 
Ste ull’s assumptions that D and K 
ox multiplicatively, that K is in- 
ERES К of Td, and, for present pur- 
Jem E 5 а value of 1 or 3 can be 
E s to the high and low values of 
d er these variables. . Examination 
ee figures indicating dif- 
reels adjacent cells clearly dem- 
нме hat a Deprivation x Reward 
eor uà is expected. . Moreover, 
E hen Hull’s multiplicative as- 
that xs is coupled with the view 
E ы po function of Td, a Depriva- 
ie n eward interaction must still 
Holed ар prediction which, as 
Confirmation” has repeatedly failed 


The E 
ub Effect of Acquisition Drive on 
sequent Performance 


ены problem of considerable 
Bent i interest and to which the 
hat of hypothesis readily extends is 
s whether the level of D present 
strume an initial training session in in- 
Siig reward conditioning affects 
Period ance in a subsequent testing 
con е altered, but equated, D 
«а ют ns—ie., whether D has an 
агу p effect" on performance. 
With | кык studies concerned 
such is question tended to find no 
or jP*severating or historical effect 
1945) (es. Finan, 1940; Kendler, 
тро, but more recent runway studies 
the Е careful procedures to insure 
testin Pie equating of D prior to 
an e Е have rather consistently found 
t (eg. Barry, 1958; Brush, 
196 rich, Teghtsoonian, & Eisman, 
> Campbell & Kraeling, 1954; 


iL 
“wis & Cotton, 1957). If D present 


TABLE 2 


IMPLICATIONS OF HULL'S ASSUMPTIONS 
REGARDING THE THEORETICAL Сом- 
BINATION OF DEPRIVATION 
AND REWARD 


Magnitude of reward 


Level of 
deprivation 
Low High Diff. 
К=1 K=3 
Low x x 2 
p= Det 
1 3 
K=1 K=3 
High x x 6 
Des fes. 
3 9 
Diff. 2 6 


during the acquisition of an instru- 
mental response does, in fact, affect the 
performance of that response under 
subsequent, equated conditions of D, 
one interpretation might be that the 
associative factor (H) is a function of 
D. Nevertheless, since H, once ac- 
quired, is usually considered to be rela- 
tively stable or permanent, this as- 
sumption leads to the expectation of 
a persistent influence of acquisition D 
on subsequent performance. The usual 
finding, however, has been that the 
perseverating effect of acquisition D, 
when obtained, is quite transitory. 
Thus, if a group trained under high D 
and tested under low D is compared 
with a group trained and tested under 
low D, the superiority in performance 
of the former group typically disap- 
pears after the first few trials, thus 
suggesting that some historical factor 
other than H has been affected by 
acquisition D. 

The hypothesis proposed in the pres- 
ent paper—that deprivation level is a 
determinant of K—also leads to the 
expectation of an historical effect of D 
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in instrumental reward conditioning. 
Thus, since it is presently assumed 
that K is an historical variable which 
is a function of the level of Td under 
which it is acquired, those subjects 
trained under high D will enter the 
testing phase of such an experiment 
with a higher level of K than the sub- 
jects trained under low D. Due to the 
reduction in D, however, the strength 
of Re for the initially high D group 
will be lessened at the beginning of the 
test phase. Further, since the strength 
or vigor of r, is assumed to reflect 
that of Re, r; — s; and, hence, К will 
also undergo rapid reduction following 
the shift to lowered D. The historical 
effect of acquisition D, therefore, while 
required by the present hypothesis, 
would also be expected to be quite 
transitory. Thus, the hypothesis that 
K is a function of Td not only consti- 
tutes an alternative to the view that Н 
is a function of D in instrumental 
reward conditioning, but appears, in 
addition, to provide a more adequate 


description of the relevant empirical 
evidence. 
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THEORETICAL NOTES 


COMMENTS ON MANDLER’S “FROM ASSOCIATION 
TO STRUCTURE” 


JOHN JUNG: 
California State College, Long Beach 


A critical analysis of some of the 
by Mandler (1962) as supportive 


which develop from gradual assoc; 


quacies in his formulation, 


evidence from transfer studies cited 
of a concept of cognitive structures 
iationistic processes revealed inade- 


Furthermore, recent studies on the effects 
ing (with control 


s for nonspecific transfer) 
-C paradigm also are diffi- 
Nor is it clear how older 


overlearning of interfering lists in retro- 
active and proactive inhibiti 


concepts can account for th 
with the concept of structure. 
useful in other areas but appa: 


the same reason, 


Verbal materials in the А-В 
affer A-C) i i 


Tesponses from the 
learning of the se 
transfer list if the first 1 
learned. 

The present commen 
Mandler's interpretation of the facts of 
overlearning or degree of first-list learn- 
ing in transfer tasks, Mandler's survey 
of relevant animal and human studies in- 
dicated that negative transfer first in- 
creases, then decreases, and in some cases 


first list 
h cond or 
ist was over. 


ts are directed at 


1 Now at York University, Toronto, 


on paradigms can be explained by Mandler. 


gs which are incompatible 


- It was suggested that structure may be 
rently not i 


n transfer of rote learning. 


becomes positive transfer as a function К 
Overlearning of the first list. It ui 
concluded on this evidence of a U fun e 
tion between overlearning and gun i 
transfer that structures are formed gra 
ally out of associations. sty of 
Several objections to the necessity e 
invoking the concept of structures to Ра 
count for such evidence will be та in 
In the first place, the soundness pu^ 
Some cases, the appropriateness of his 
findings on which Mandler based 10 
formulation can be questioned. In all n- 
the animal studies cited, as Mandler ud 
Self noted (pp, 422, 425, 426), no con rn- 
group for assessing warm-up and е о 
ing-to-learn was included. How muc s- 
the subsequent decrease in negative ps 
er as Overlearning increased in pe 
Studies is attributable to specific and fer 
much to nonspecific sources of trans 
is hard to say. ing 
Furthermore, discrimination-learnine 
Situations such as those used in the cM 
Studies cited are quite dissimilar from t d 
-~ transfer task, but Mandler не 
them together since they both apparen e- 
involve the learning of two distinct 5. 
SPonses to the same stimulus. Closer dis- 
amination reveals that, whereas in à 
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crimination-reversal task two stimulus 
cues (e.g., black and white) and two re- 
sponses (eg. right and left turn) are 
available on every trial, in the A-C para- 
digm the subject receives only one re- 
sponse paired with each stimulus during 
the first list, and on the second list he 
receives a new second set of responses, 
one paired with each of the same stimuli. 
The animal-learning situations cited by 
Mandler are more closely related to the 
A-B, A-Br (hereafter A-Br) paradigm 
in which the same stimuli and responses 
appear on both tasks but in different pair- 
ings, 

With respect to the three human studies 
using the A-C paradigm cited by Mandler, 
Only one (Mandler & Heinemann, 1956) 
Contained an A-B, C-D control condition 
to assess warm-up and learning-to-learn. 
In addition to this absence, the Siipola and 
Israel (1933) study, although showing 
Positive transfer with overlearning, in- 
Volved an A-Br rather than an A-C para- 

igm. Positive transfer has been found 
when low meaningfulness (M) or difficult 
Ss pohges such as those used by Siipola 
re Israel are employed in the A-Br para- 
Mo (Mandler & Heinemann, 1956; 
mae & Battig, 1963). However, sub- 
du negative transfer can be obtained 
sith this paradigm when high-M re- 
Ponses are used (Besch & Reynolds, 
1963, Jung, 1962; Merikle & Battig, 
19625, Porter & Duncan, 1953; Postman, 

ie : Twedt & Underwood, 1959). 

s of response learning rather than 
or ыыы of Structures can account 
paradi positive transfer obtained with this 

acigm with low-M responses. 
ie though evidence from transfer stud- 

is ited by Mandler is weak support for 
not concept, recent findings which were 
E Available at the time of Mandler's 
og but which include the necessary 
Ses for nonspecific transfer do sug- 
us p transfer under the A-C paradigm 
lee be a U function of degree of first-list 
POE to a small extent (Jung, 1962; 
ai 1962b). Although neither 
пе Y reported significant differences in 
| ЁаЧуе transfer as a function of first- 
sha Overlearning, Postman found a U- 

Ped trend, and Jung, using only two 
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degrees of first-list overlearning, found 
trends toward less negative transfer with 
overlearning during the last half of the 
transfer task. No positive transfer was 
obtained as in studies lacking the proper 
control condition; this suggests that com- 
petition between the associations of the 
two lists is still operative with overlearn- 
ing. 

Secondly, there are certain established 
findings regarding the effects of over- 
learning of the interfering task in retro- 
active and proactive inhibition (RI and 
PI, respectively) studies which appear 
to be contrary to a notion of cognitive 
structure. With PI, a number of studies 
have found that increased first-list learn- 
ing leads to enhanced PI (e.g., Atwood, 
1953; Postman & Riley, 1959). The con- 
cept of structure should predict less PI, 
as the first list is overlearned. In the case 
of RI, findings (e.g., Briggs, 1957; Thune 
& Underwood, 1943; Underwood, 1945) 
indicate that the greater the degree of 
learning on the interpolated list, the more 
RI, although in some cases it decreases 
with high overlearning but not to the ex- 
tent of zero RI which a concept of struc- 
ture would predict ultimately. Thus, in 
both PI and RI paradigms, overlearning 
of the interfering list relative to the de- 
gree of learning of the list-to-be-recalled 
results in greater interference rather than 
a reduction of negative effects as implied 
in the concept of structure. 

Third, recent experiments (Tung, 1963: 
Merikle & Battig, 1963; Spence, 1963) 
demonstrating the effects of response 
on transfer in the A-C paradigm also pro- 
vide difficulty for Mandler. If the writer 
understands the concept of structure cor- 
rectly, structure is a direct function of M. 
In the A-C paradigm, therefore, less nega- 
tive transfer should result with responses 
of high M (or structure). Small but 
consistently greater negative transfer has 
been found with high than with low M 
when trigrams were used (Jung, 1963) 
or when words and trigrams were used 
(Merikle & Battig, 1963). A different 
method of assessing the effects of first- 
list structure on A-C transfer was em- 
ployed in one of Spence's (1963) experi- 
ments. She employed a first list which 
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contained pairs of words which were 
strong associates of each other ; on the 
transfer list she used the same stimuli 
from the first list but paired them with 
a different set of words which were weak 
associates of the stimuli, Spence main- 
tained that the first-list associates which 
she used undoubtedly had more structure 
than any practical amount of laboratory 
overlearning could Provide since the 
strong preexisting associations used in 
that list had been formed over a subject’s 
lifetime. 

According to the concept of structure, 
the first list of strong associates should 
not interfere with the learning of the 


However, Spence reported that negative 
transfer was obtained even under these 
conditions, 


Thus several r 
fects of response 
associative first-li 


A-C paradigm Provide evidence с] 
Opposed to the Concept of structure, 


From Structure Back to Association 


As Mandler noted, 
then outline th j 
ently involved 


concept of structure 
sociative concepts, 


The general findings of increases, then 
decreases, in negati 


by means of as- 


tion, response availability, 
differentiation. In this paradi 
dissimilar responses are pa 
same stimuli on the two | 
source of specific transfer is that of as- 
sociative learning (Jung, 
i.e, associations formed on the first list 
interfere with the formation of new as. 
sociations to the same 
second list, hence, negative transfer, 
Barnes and Underwood (1959) and 
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Postman (1962a, 1962b) demonstrated 
that after the learning of the second it 
in the A-C paradigm, the first-list e 
sponses were less available than the odi- 
ond-list responses as measured by a mod S 
fied free-recall task. Furthermore, Barne 
and Underwood found that this a a 
in availability increased with higher di- 
grees of second-list learning. This а 
minished response availability sugges 8 
to Barnes and Underwood that the el 
peting first-list associations underwent e 
tinction during second-list learning. of 
By analogy, increasing the degree ee 
first-list learning for a constant. йге 
of second-list learning should eed 
availability of competing first-list dee 
ations during second-list learning, Eon 
Setting to some extent the QUEUE o 
Which such associations also €— 
Indirect support for this hypothesis ег 
be found in the fact mentioned gat p 
that increased PI (poorer retention à 
the second list) results as a function 
overlearning of the first list. m 
The increase in negative Bender, а 
Served with moderate overlearning 062b) 
first list (Jung, 1962; Postman, 19 dd 
may be due to the fact that the interferi re 
first-list associations are simply mo s. 
available during second-list y 1 
But, as has already been pointed out, ac 
Overlearning of the first list leads to a st- 
crease in A.C negative transfer. Е 
man (1962b) accounted for this fai us 
of high Overlearning to lead to rie 
greater negative transfer by means 0 ed. 
ifferentiation which had develope. 
en two lists are practiced to an no 
degree, the differentiation between nt 
lists is minimal; however, as the pop 
of practice on one list relative to that be 
the other increases, differentiation may h 
Said to occur. Such is the case with а 
Overlearning of the first list in the ёв 
Paradigm; such differentiation a 
the competition between responses cm 
the two lists, thus producing less negat! 
transfer, ton is 
One difüculty with this el dp UM 
that while differentiation may lead to de- 
creases in overt intrusions (Thane it 
Underwood, 1943; Underwood, nig n 
oes not necessarily lead to reduce 
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terference in the RI task. While Under- 
wood may be correct in maintaining that 
differentiation does not reduce RI (or 
PI), this is not to rule out the possibility 
that differentiation may be a potent aid in 
reducing response competition and nega- 
tive transfer during the original learning 
of the second list. According to Under- 
wood, differentiation dissipates over time; 
Since the retention of the first list is 
measured a long time after the learning 
of the second list in the RI paradigm, the 
ameliorative effects of differentiation on 
retention may be small or absent due to 
its dissipation during the long interval 
etween interpolated learning and a re- 
tention test of the first list. 
к Turning to the effects of overlearning 
їп RI and PI studies which appeared in- 
compatible with the concept of structure, 
response availability seems to be a useful 
Concept. It will be recalled that more 
‚15 generally found, the higher the first- 
ist learning, and that more RI is gener- 
ily found, the higher the second-list 
„arning. Both cases could be due to the 
Increased availability of associations 
which are incompatible with those re- 
qued in the less well-learned list-to-be- 
is died. As pointed out earlier, due to 
awe nature, any differentiation 
hess also develops. with overlearning 
tion not reduce negative effects in reten- 
5 n (RI and PI) although it may assist 
"ginal learning of the second list. 
nee mally, how can the findings of greater 
„иче transfer with high-M responses 
accounted for by associative concepts ? 
E tl can be assumed that response avail- 
ty is a direct function of response M, 
fer the finding of higher negative trans- 
is With high M is not surprising. That 
4 associations formed on the first list 
n either be forgotten or extinguished 
моц, if they are of low М; thus they 
the d be less available to compete with 
ond New associations required on the sec- 
те list. No direct evidence on how 
is Ponse M affects response availability 
H available to the writer’s knowledge. 
no, Yer, for a wide range of M from 
Ha Syllables to words, the assump- 
ue what availability is a direct function 
T might be based on the letter-se- 
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quence gradient of extraexperimental in- 
terference (Underwood & Postman, 1960). 
According to this notion, letter sequences 
contained in experimental materials are 
subject to such extraexperimental inter- 
ference (and forgetting) to the extent 
that they differ from those sequences oc- 
curing in the natural language. Thus, 
low-M materials would suffer the most 
interference from this extraexperimental 
source, and their availability would be 
diminished accordingly. 

Consequently, negative transfer may be 
less with low-M responses simply be- 
cause low-M associations are less avail- 
able during second-list learning. On the 
other hand, a concept such as Mandler's 
should predict just the opposite, more 
negative transfer with low-M responses. 

Greater negative transfer with high-M 
responses is not limited to the A-C para- 
digm. In fact, although the A-B, C-B 
(hereafter C-B) paradigm is generally 
regarded as a positive or, at least, zero 
transfer paradigm, negative transfer has 
been obtained with high-M responses 
(Tung, 1963; Twedt & Underwood, 1959), 
In the Jung study in which response M 
was varied, high-M responses led to neg- 
ative while low-M responses gave posi- 
tive transfer. Two sources of transfer 
in this paradigm can be examined here, 
response learning and backward associ- 
ations. The lower the response M, the 
greater the response learning necessary 
in this paradigm. Whereas positive trans- 
fer is obtained when response learning is 
needed, negative transfer is found with 
high-M responses which require little or 
no response learning. 

The backward associations of the C-B 
paradigm, according to Twedt and Un- 
derwood (1959), constitute an A-C para- 
digm in which the first-list backward 
associations (B-A) compete with those 
formed on the second list (B-C). Just 
as it was hypothesized earlier that re- 
sponse M influences first-list response 
availability of forward associations in 
the A-C paradigm, it may be argued that 
response M also directly increases the 
availability of first-list backward associ- 
ations in the C-B paradigm. Thus the 
greater availability of interfering back- 
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iati lus the absence of re- 
ко Teaming Ie to negative transfer 
hoo high M whereas low-M responses 
lead to positive transfer in the C-B para- 
digm due to the presence of transferable 
response learning and lower availability 
of competing backward associations from 
the first list. 

In summary, it has been argued that a 
consideration of the “fate” of the first-list 
associations during the learning of the 
second list of the A-C paradigm provides 
an associationistic explanation of the ef- 
fects of first-list overlearning and re- 
sponse M on transfer in this paradigm. 
Overlearning, though eventually checked 
by differentiation, leads to greater nega- 
tive transfer by increasing availability of 
competing first-list associations during 
second-list learning. These two factors 
counteract each other, thus differences in 
negative transfer in this paradigm as a 
function of degree of first-list learning 
are small, 

Higher response M 
negative transfer also b 
ability of the first-list 
subsequently interfere 
of the second list, 


О need is seen for invoking cognitive 
Structures to account for transfer effects 
in verbal learning. This is not to deny 
the possible usefulness of such concepts 
in explaining learning sets, discrimina- 


tion-reversal learning, concept identifica- 
tion, and the like, 
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SUBJECTS DO THINK: 


A REPLY TO JUNG'S COMMENTS? 


GEORGE MANDLER 


University of Toronto 


Jung's critique is found wanting on 2 points: (a) the definition of 
overlearning and the effects that a structural position actually predicts, 
(b) the suggestion that differentiation is an associative concept. The 
data Jung discusses are found to be consistent with Mandler's original 
position, as is the notion of "differentiation," and it is suggested that 
Ss do think, even in paired-associate experiments. 


E (1965) comments raise a num- 
sud. issues concerning the effects of 
А саш on the A-B, A-C paradigm 
ne verbal learning. There are a number 
m points that cannot and need 
se e dealt with at length, such as the 
i on that the subjects in a paired-associ- 
м oe learn single responses to 
TM stimuli, and the fact that Under- 
that t; Postman's (1960) data indicate 
RN je time interval typically used 
rn -B, A-C paradigm the letter-se- 
net e effect does not predict lower 

3 ability for low-M responses. 
dh Will restrict my present comments to 
bt д: What does an overlearning- 
das ure position (Mandler, 1962) pre- 
et ү “ What is the status of the concept 
erentiation ? 


Over. i 
verlearning and Transfer 


gli sites various studies—and in par- 
а a in relation to the A-Br paradigm 
ROA Б retroactive- and proactive-inhibi- 

it g I and PI, respectively) studies— 
sete s not involve overlearning in my 
that e the term. It should be quite clear 
Um че overlearning to be a func- 
rates the number of trials following the 
айе nment of completely correct perform- 
fined Thus, overlearning cannot be de- 
crit as a percentage function of trials to 

m as Postman (1962), for example, 
S one, Following this reasoning, one 

Оша obtain 200% overlearning after 


1 
Б Preparation of this paper was supported 
T ao Science Foundation Grant 
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six trials of acquisition if a list only takes 
two trials to reach criterion. It is sur- 
prising that both Jung in the present 
paper and other investigators have over- 
looked this definition of overlearning. If 
enough overlearning is introduced, even 
the A-Br condition should yield the ex- 
pected results. 

What predicted effects can be derived 
from an overlearning-structure position? 
I am partly at fault if there is any misun- 
derstanding. But it should be fairly clear 
that overlearning of A-B should produce 
less negative transfer and probably zero 
transfer—with the appropriate controls— 
in the A-B, A-C paradigm. This point 
was made in my earlier paper on this 
problem: “The present analysis therefore 
predicts less interference with high de- 
grees of learning [Mandler, 1954, p. 241]” 
as well as in 1962: “When... negative 
effects change to zero or positive effects 
. . . it is appropriate to invoke some 
[other than associative] hypothesis [p. 
425].” The use of the term “posi- 
tive" in the last quotation is роѕ- 
sibly incorrect. All that is implied is 
that there should be a reduction in nega- 
tive transfer following high degrees of 
overlearning of the A-B list. Thus, the 
studies by Jung (1962) and Postman 
(1962) both supply positive evidence for 
this prediction. Spence’s study (1963) 
is, at worst, neutral in this regard. The 
structural position predicts that a first 
list of strong associates should interfere 
less with the second list than a first list 
of weak associates would. In fact, 
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Spence’s data show 7% and 21% negative 
transfer for two different lists, which at 
least in the first case is the predicted low 
degree of negative transfer. 

One additional question needs to be 
clarified: What is it that is overlearned? 
Many different structures „тау be ac- 
quired, and, for example, either the spe- 
cific response or a set of responses may 
be overlearned. In the latter case, the 
structure is the structure of all responses 
in a list; the subject can think of the list 
as a unitary set. This will be an impor- 


tant point when we return to a discussion 


ut for the time being 
I want to apply this distinction to the 
problem of meaningfulness, It may be the 
case that, when we are dealing with pre- 
viously overlearned, integrated, discrete 
Tesponses—as all high-M words are—the 
Subject is faced with 
Similar integrated ге 
which occur in the A- 
The individual items h 
learned and are struct 


of differentiation, b 


sponses 


Set of meaningful Words will continue to 
Occur. With low i 


yet-to-be-integrate 
ject may need o 


turally available В 
C response, 


Differentiation Is Structure 


Jung claims that res 
response availability, and response dif- 
ferentiation are adequate to account for 
the decrease in negative transfer follow- 
ing overlearning. I have no arguments, 
of course, about the first two factors 
(cf. Mandler, 1954). And even a cur- 
sory examination of the notion of dif. 


ponse competition, 
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ferentiation in the two papers that have 
invoked it to account for the са. 
effect (Jung, 1965; Postman, pen 
reveals that it is essentially identical with 
my nonassociative concept of structure. 
From the evidence presented it is obvi- 
ously not what Jung claims it to be; an as- 
sociative notion, В 

First let me clear away some confusion 
in Jung’s description of differentiation. 
He says that “as the amount of practice 
on one list relative to that on the other 
increases, differentiation may be said i 
occur [p. 320]." Does Jung really wan 
to maintain that differentiation. o 
a U-shape function as the learning of t а 
second list in the А-В, А-С paradigm pro 
ceeds? Obviously his statement нари 
that on the first trial of the A-C list di е 
ferentiation between the two lists ge 
be large; practice on A-B relative C 
A-C is large. As learning on the А- 
list continues, differentiation will “ee 
decline and finally rise again. I do Md 
quite know what predictions one WoR 
derive from this deduction; they might i 
interesting, though Jung does not make 
them. However, Postman (1962) im- 
plies that differentiation does not occur 
until overlearning on the A-B list rus 
taken place, Note that this is what en 
development of structures is about. Bu 
obviously Something is needed to explain 
the high availability of the B responses 
following overlearning. 

In a recent experiment in our DE 
tory (McMillan, 1964), we found tha 
modified free recall of overlearned 
items was at about the 92% level eel 
ing second-list learning, while the B recal 
when the first list was learned only to cri- 
terion was about 6095, and yet there Aur 
—admittedly to our dismay— practically 
no differences between the overlearne 
and the criterion groups in the mean num- 

er of trials to criterion of learning the 


-C list. In fact the overlearned L 
was slightly though not significantly 
better, 


In other words, and as we iod 
pected, response availability of the В i 
Sponse—as measured by the modified пе 
recall technique—does not predict amount 
of negative transfer with overlearned us 
terial. In order to handle this speci®c 
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Problem, | Jung (1965) and Postman 
(1962) introduce the concept of dif- 


саны But how does a subject 
ifferentiate list membership? When 
Presented a particular stimulus item, 


he tends to make one of two responses; 
the B response or the C response. If 
the two lists are “differentiated,” he is 
a likely to make the C response, if 
d are not "differentiated," he is more 
iod to give the B response. But this 
E olves a complex operation and is not 
iei i el phenomenon in the usual 
io ы the term. As I understand as- 
Bela 6 phenomena such аѕ геѕропѕе 
do ene and response availability, they 
dm require operations on or manipu- 
ties d of the materials by the subject, 
Жашы ee involve cognitive transforma- 
ferenti о say that the subject can dif- 
сап mei list membership means that he 
терра ke а choice between two possible 
the em and he makes that choice on 
Guise of a covert operation on the 
simpl к available responses. More 
cidis = the subject thinks and then de- 
should hat this is the second list, and he 
is Ke be giving a first-list response. 
referir what I am talking about when 
Struct S to the development of cognitive 
aa. es: We have suggested that over- 
and he uw with the old response 
ture B ormation of an analogic struc- 
respone Ius the subject to manipulate that 
out maid to 'think about' the problem with- 
p. 425] ing overt errors [Mandler, 1962, 
Statement I see no conflict between this 
беп ies the following quotation from 
entiation (1962, p. 116) about the differ- 
Хур process: With highly integrated 
jection Fr pom the discrimination and re- 
Speed А» errors may occur with sufficient 
Correct , permit the substitution of the 
interval тезропве during the anticipation 
ative Я Rejection’ is not an associ- 
erenti nechanism, and thus neither is dif- 
ales ation which, according to Postman, 
With LS possible. Iam concerned 
о tak е mechanism that permits rejection 
Tepresc; place, and we need structural 
Such ntations of some sort to perform 
B fr operation. Overlearning of an 
ist apparently both preserves the 
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A-B “bond” and makes it possible for the 
subject to manipulate the B responses. 

Let me add some further evidence from 
the experiment by McMillan (1964). She 
instructed subjects to write down the first- 
and second-list responses as they came to 
mind, ie, in Positions 1 and 2. Thus, 
order of recall scores were available, and 
it was found that out of 8 high-fre- 
quency responses, i.e., from 8 pairs, 
subjects who learned the first list to cri- 
terion gave 1.8 B responses and 5.8 C 
responses in Position 1, while subjects 
who had overlearned the A-B list for 15 
trials gave 5.6 B responses and 24 C 
responses in Position 1. Despite the fact 
that there were no significant differences 
in negative transfer, either in terms of 
trials to criterion or number of correct 
responses, the overlearned subjects had 
been able to maintain and remember the 
A-B "bond" at high "strength." Thus, 
even strongly maintained associations be- 
tween A and B, associations (not just 
availabilities) that survive A-C learning, 
can be manipulated and produce no more 
negative transfer than relatively weak 
A-B "bonds" that do undergo unlearning 
during second-list learning. 

Finally, I would like to refer Jung to 
the literature on the effect of anticipa- 
tion intervals in the A-B, A-C paradigm. 
Nodine (1963) and others have shown 
that negative transfer decreases as а func- 
tion of the anticipation interval employed. 
If one accepts the notion that in the A-B, 
A-C paradigm the subject must suppress 
the B response during second-list learn- 
ing, then an increase in the length of the 
anticipation interval would reduce the 
time pressure and make suppression of 
the inappropriate first-list response easier. 
Similarly, first-list overlearning makes 
such suppression easier. In other words, 
either long anticipation intervals where 
the subject can think about the appropri- 
ate response, Or overlearning, where the 
subject has the inappropriate response 
readily available and readily suppressible, 
produce less negative transfer. — 

Postman (1962) and, by implication, 
Jung invoke “differentiation” to explain 
how a subject rejects (or suppresses) an 
incorrect response. That is the central 
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problem, and it cannot be solved by in- 
troducing terms that sound better than 
others in order to salvage the consistency 
of a particular theoretical school. I think 
we should use concepts of whatever par- 
entage and not be particularly concerned 
with such salvage. The use of the term 
“structure” suggests that the subject is 
able to manipulate several 


Tesponses 
available at the same time. The study of 
human verbal learning js part of the 


problem of thinking, 
concerned how subjec 
late covert Tesponses, how they identify 
them, reject them, avoid interference, how 
—in short—thinking Proceeds, 

I believe that the invocation of “dif- 
ferentiation” js additional support for a 
nonassociative explanation of the effect of 


and we should be 
ts learn to manipu- 


ta frequently used con- 
-associate ex- 
of overlearn- 


THEORETICAL Notes 


foolhardy. Subjects do think, even when 
learning paired-associate lists, 
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PSYCHOLOGICAL REVIEW 


TRANSFER OF VERBAL PAIRED ASSOCIATES* 


EDWIN MARTIN? 


University of Iowa 


From the current literature it is possible to identify 2 processes 
underlying the acquisition of verbal paired associates: response learn- 
ing and association formation. It is also apparent from the litera- 
ture that a complete treatment of association formation must take 
into account association directionality. Altogether, then, 3 "things" are 
seen to evolve during the acquisition of a paired-associate list: re- 
sponse availability, forward associations, and backward associations. 
The thesis of the present research is that what is transferred from 
the Ist to the 2nd task in a paired-associate transfer situation is some 
combination of these 3 effects. Utilizing the coordinate system in- 
vented by Osgood in which all the transfer paradigms can be ar- 
ranged, 3 transfer surfaces are proposed which describe how each of 
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the 3 effects is transferred individually. 
It is found that the principal results 


transfer literature are made. 


Applications to extant 


of nearly all experiments utilizing the A-B, C-D control paradigm 


can be accounted for. 


е... Problem of transfer of verbal 
Mur associates originated with 
bin ег and Schumann (1894), whose 
ien of associative inhibition was essen- 
fans Specification of the negative 
ML paradigm A-B, A-D, and 

Uller and Pilzecker (1900), who de- 


ior d article derives in part from a doc- 
lege Issertation submitted to Graduate Col- 
fu], D niversity of Iowa. The author grate- 
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Hungonttact No. AF 49(638) 1235 with the 
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veloped the method of "right associ- 
ates." А closely related problem, one 
with which transfer is, in many re- 
spects, inextricably intertwined, is that 
of retroactive inhibition, which also 
originated with Müller and Pilzecker 
(1900) in conjunction with their 
perseveration theory of reproduction 
inhibition. In 1949, Osgood attempted 
to organize the many facts and insights 
that had been accruing in this general 
problem area since Müller's time by 
proposing a transfer and retroaction 
surface. Thus the period from Müller 
to Osgood is a historical package, so 
to speak, with the most articulate sum- 
mary being Osgood's paper. The 
purpose of the present paper is a re- 
organization based on additional facts 
and insights, some of which have ac- 
cumulated since 1949, with the restric- 


327 


328 EpwiN 
tion that only transfer problems will be 
ce а of the effects of having 
learned one set of paired associates on 
the subsequent learning of another, 
four experimentally controllable "o 
ables emerge for which there are suffi- 
cient data to draw definitive conclu- 
sions. Two of these, interlist stimulus 
and interlist response similarity, have 
received considerable attention and 
were incorporated into Osgood’s 

(1949) transfer theory. The other 
two, degree of first-task learning (L) 
and response meaningfulness (M), 
haye been treated experimentally but 
not explained theoretically. To grasp 
the empirical relationships among these 
four variables, it seems best to first 
examine Osgood’s theory and then to 
describe how L, and M un 
tematic departures, 
Osgood’s (1949) 
transfer theory can be 
into two components : ( 


of a coordinate system in which all 
paired-associate paradigms can be ar- 
ranged, and (b) a summary of avail- 
able data in the form of à surface which 
describes how amount of transfer and 
position in the Coordinate system are 
related? The Coordinate system has 


3At approximately thi 
parently independently, 
made an equivalent pro; 
Same data, plus expectati 
tion theory, in the fo 

is curves are essential 
given by Osgood. 


derlie sys- 


contribution to 
seen to resolve 
а) the invention 


€ same time, and ap- 
Underwood (1949) 


posal based on the 
ons from generaliza- 
rm of four curves, 
Пу the surface edges 


Fic. 1. Component transfer surfaces, 
response availability, 


Martin 


the important feature of pee, m 
tinguishing the separate roles en 
terlist stimulus and mema cs dios 
dissimilarity. These two varia m *. 
expressed as orthogonal axes, E are 
and Xp axes of Figure 1F, w od 
origin representing complete ees a he 
or identity. Thus, for oie kei 
negative transfer paradigm A- ^1 Xn 
has the coordinates Xs = 0 poe 
some sufficiently large value to in ated- 
complete dissimilarity (or par: cor 
ness) between the responses О re 

first- and transfer-task lists. All T i 

fer paradigms are assigned a bow diu 
position in joint accordance wit. 
dissimilarity between the stimuli E 
the dissimilarity between ш, е 
of the two lists. Surface points ne we 
the Xs-Xp plane represent ene 
transfer; points below, пева 
transfer. 

A matter to be taken into n : 
before proceeding further is that o tas 
so-called “opposed” relationship. af 
tween the two tasks. With ime 
identical, for example, the responses fe 
the first and transfer tasks cani mi 
identical (A-B, A-B paradigm), D) 
lar (A-B, A-B^), neutral (A-B, A- E 
Ог opposed. A number of пова 
tors, including Osgood, have used xe 
last condition. As an example of i 
OPPosition relationship, sickly is i] 
lar to pale, but healthy is opposi 
Usually, response opposition prodot 
less negative transfer than the “et: 
ity relationship of the A-B, 


C" ———————— - 
а "V.-—— ———— АШ 
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paradigm (eg. Bugelski & Cadwal- 
lader, 1956; Wimer, 1964). 

If two items are judged opposite in 
meaning, they must perforce be re- 
lated; otherwise, opposition could not 
be established. Generally speaking. 
the axes of Osgood's coordinate sys- 
tem are seen as representing similarity 
in meaning. However, it would be 
more consonant with the present status 
of interference theory to suppose that 
the axes represent associative related- 
ness. Similarity would clearly be in- 
cluded ; but, more importantly, oppo- 
sition would thereby be placed at points 
on the axes between identity and com- 

Plete dissimilarity instead of beyond 
E complete dissimilarity point. To 
ol recognize opposition as à category 
ae edness achieves two reasonable 
^ ig i (a) It places data points 
aea from opposition relationships 
the А ееп Tasks 1 and 2) in an area of 
is & coordinate system where transfer 
"d a to be less than maximally 
"a ive. In the experiment by Bugel- 
Lo Cadwallader (1956), the re- 
ea dal condition would thus 
A-B ae between the identical (A-B, 
ак I similar (A-B, A-B') condi- 
os but closer to the similar condi- 
diee ^ the Wimer (1964) experi- 
К = response-opposition condition 
A-B) all between the similar (A-B, 
or and unrelated (A-B, A-D) con- 
link s. (b) It would more completely 
verb interference interpretations „of 
al learning to the transfer situation 

AA removing the restriction that the 
S aires: represented by the 
ity tdinate axes is that of strict similar- 
basi Associative relatedness 1s the 
а for both synonymity and an- 
ymity. There can thus be no 
ee on the dissimilarity axes be- 
Опа the completely unrelated para- 
Igms (А-В, С-В for stimuli; A-B, 
M for responses). This is some- 
Nat of a departure from Osgood’s 
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original conceptualization, but a de- 
parture which nevertheless seems 
necessary if thinking about what is 
transferred is to square with current 
thought on what is acquired when 
learning takes place. 

At positions where the suríace is 

either above or below the Xs-Xxr plane, 
positive or negative transfer, respec- 
tively, is supposed to obtain. A perusal 
of the transfer studies whose designs 
constitute a test of Osgood’s surface, 
however, reveals that although the 
general shape of the surface is fairly 
well substantiated, there are a number 
of systematic discrepancies which can 
be attributed to specific experimental 
factors, namely, L, and response M. 
(In the ensuing discussion, only ex- 
periments utilizing the A-B, C-D con- 
trol paradigm for nonspecific effects 
will be cited ; thus amount and sign of 
transfer at other positions in the coor- 
dinate system are always with respect 
to transfer-task performance at the 
A-B, C-D position.) 

The effect on transfer-task perform- 
ance of increasing L, is either to de- 
fer or to increase 
negative transfer, depending upon the 


position 
Bruce (1933) found that as L, in- 


experiment, transfer progress 
positive to negative in the A-B, C-B 
paradigm and became more negative in 
the A-B, A-D paradigm as L, in- 
creased from a 3/6 to a 6/6 + 5 cri- 
terion. Dean and Kausler (1964) 
have shown а striking shift from post 
tive transfer with a first-task criterion 
of Та 376 to negative transfer with 
I4 6/65 5 in the A-B, CB para- 
digm. Further substantiation, of this 
role of degree of L, is provided by 
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Spiker (1960) for the A-B, р par- 
adigm and by Postman (1962b) for 
several paradigms. " 
If overlearning of the first task is 
considered, the foregoing clear picture 
of the role of degree of L, becomes 
somewhat clouded. Postman (1962b), 
while reporting an increase in nega- 
tive transfer with a change from a 6/10 
to a 10/10 criterion for L, in both the 
A-B, C-B and A-B, A-D paradigms, 
finds a sharp reduction in amount of 
negative transfer for а 10/10 + 50% 
criterion. Mandler and Heinemann 
(1956) report negative transfer in the 
А-В, С-В paradigm after 10 errorless 
L, trials but positive transfer after 30 
and 50 such trials, with a return to 
negative transfer after 100 trials. In 
the A-B, A-D paradigm, transfer re- 
mains negative throughout but shows 
fluctuation similar to that in the A-B, 
C-B. These results are difficult to 
reconcile both with each other 
with data not involv 
Mandler (1962) has presented a Sys- 
tematic argument for negative trans- 
fer as a V-shaped function of degree of 
Тл, transfer becoming more negative 
then less negative in the A-B, A-D 
Paradigm. It should be noted, how- 
ever, that in making that argument, 
Mandler cites his own data (Mandler 
& Heinemann, 1956) but leaves out the 
data point for his greatest-degree-of- 
overlearning condition, a data point 
which denies the V-shaped function, 
Other data cited by Mandler in support 
of a V-shaped function are not relevant 
in the present context because they do 
not include A-B, C-D control para- 
digms for nonspecific effects, Thus, 
although degree of Is inversely affects 
transfer in the A-B, A-D and A-B, 
C-B paradigms in a regular manner at 
ordinary degrees of L,, a definitive 
conclusion regarding the role of over- 
learning does not seem possible at this 
point. 


and 
ing overlearning. 


Where M is taken as a generic ко 
referring to the соге attribute of ver x 
units reflected by such highly inter 
correlated scales as meaningfulness, 
association value, and pronunciability 
(Noble, 1963), the effect of first-list 
response M on transfer-task aa 
ance is to increasingly impede e 
performance as M is increased. Jung 
(1963) showed that when respon za 
are low-M trigrams, considerable po A 
tive transfer and barely negative tran d 
fer are observed in the A-B, рзи 
А-В, A-D paradigms respectively ; pe 
when responses are high-M p aie 
transfer in the A-B, C-B mee 
nearly zero, and transfer in the ner 
A-D becomes very negative. V TF 
M over three levels (low =C. a 
medium—CVCs, high—words), M " a 
kle and Battig (1963) showed т D 
M increases, transfer in the A-B, че 
paradigm decreases from barely nege 
tive to considerably negative. РР 

Further evidence on this matter mrt 
be had by categorizing available r3 
according to whether transfer is us 
tive or negative in the A-B, C-B pa d 
digm. If this is done, it turns E 
that all studies yielding positive ed 
fer involve trigrams as respon & 
(Bruce, 1933; Jung, 1963; Meier n 
Heinemann, 1956), whereas. DE 
yielding negative transfer pers 
meaningful responses, such as 21953; 
tives (Dallet, 1962; Harcum, n 
Kausler & Капон, 1963; Keppel & Ur 
derwood, 1962; Twedt & шегу, 
1959; Wimer, 1964). Ап exe н 
to this categorization is the Б ich 
and Cadwallader (1956) study, ks : 
Teports positive transfer at the a 

-B position with words as peint é 
materials. Their procedure io eit 
dropping out pairs from the gener 
list once given correctly twice in S 
Cession. ^ Further, they used vis 
patterns as stimuli. eM 

Thus degree of L, and respons 


- — йй 
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appear to be major experimental fac- 
tors responsible for systematic devia- 
tions from Osgood’s surface. There 
are, of course, other factors whose ef- 
fects will eventually have to be ac- 
counted for, for example, intralist 
similarity, list length, and time be- 
tween L, and the transfer task. At 
the present time, little is known about 
nm i" of the first two on the trans- 
| ask ; and, adhering to the restric- 
on of considering only transfer data 

which includes the A-B, C-D control, 
on one study, Newton and Wickens 
PUB) „is known to the present writer 
ся rning the last. If one is willing 
а ке percent saving (the percent 
Pe de erence in the number of trials 
с iterion for the two tasks is of the 
Pod for the first task) in the A-B, 
s paradigm (their Experiment I) 
CD percentage of saving in the A-B, 
it rises (their Experiment П), 
ined b out that the differences ob- 
tie f y subtracting the latter from 
E^ E for intertask delays of 0, 
Thus f 48 hours is —26, -8, and +5. 
is a the A-B, A-D paradigm there 
ge ence that the associations which 
еф: interfere during transfer-task 
anda” are lost over time. | The 
liens customarily cited on this phe- 
Slop тзв, however, either do not ет- 
a A-B, C-D control for non- 
1938) effects (Bunch & McCraven, 
effects or deal only with the nonspecific 
Hari of the A-B, C-D paradigm 

amilton, 1950). 

a шш, while the coordinate sys- 
pipa efinitional in nature, the surface 
on sed by Osgood is an empirical 
view = induction from the facts. In 
кы of the growing complexity of the 
ofl as revealed, at least, by the roles 
е нан M, however. a. revised induc- 
eid = not seem feasible; instead it 
impli appear easier to turn to the 
"i cations embedded in already 
ant verba] learning theory for an 
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improved formalization of transfer 
phenomena. On this view, two devel- 
opments are of special interest: the 
two-phase conceptualization of verbal 
learning and the notion of association 
bidirectionality. 

The idea that verbal learning may 
profitably be viewed as more than 
one process seems to have originated 
with Thorndike (1932), who notes 
that “other things being equal, connec- 
tions are easy to form in proportion 
as the response is available [p. 343]” 
and that “much of learning consists in 
making certain responses more avail- 
able [p. 347]." Current expression of 
this notion is well exemplified by 
Underwood and Schulz (1960) : 
“Logically speaking, the acquisition of 
a serial or paired-associate list can be 
divided into two stages. The first will 
be called responsc-learning or response- 
recall stage. Tt occurs temporally 
prior to the second stage which will 
be called the associative or hook-up 
stage [p. 92].” 

The utility of distinguishing two 
phases of the total verbal learning 
process is readily discernible in the 
literature. For example, Underwood, 
Runquist, and Schulz (1959), Horo- 
witz (1961), and Carterette (1963) 
have demonstrated that the two phases 
can be discriminated experimentally on 
the grounds that a single variable 
differentially affects certain response 
measures separately identifiable with 
the two phases. It was found that 


while variations in intralist response 


had a direct effect on re- 


similarity 
sponse learning (a5 measured Бу 
recallability). association formation (as 


responding to 
or by ordering 
ersely affected.* 


measured either by 
fic stimulation 


speci $ 
) was mv 


performance 


4 Caution must be exercised : in claiming 
that high intralist response similarity facili- 
tates response learning. When similarity 
among items is produced by means of either 
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A more detailed consideration of these 
two components of learning, plus 
others, has been given by McGuire 
(1961) : He argues cogently for their 
utility and demonstrates experimentally 
their distinguishability. 

Since it is meaningful to view the 
acquisition of verbal paired associates 
as a composite of two processes, re- 
sponse learning and association forma- 
tion, it makes sense to conclude that at 
the end of L, in a transfer situation 
there are at least two “things” available 
for transfer to the second, or transfer, 


task; namely, response availability and 
associative connections, 


being the product of first-t 
learning, the latter of firs 
ation formation, 
Admitting that associ 
tions are available for 
task to task entails a co 
the rather extensive lit 
ward associations, 
problem apparently 
binghaus (1885) 
a result of the le 


the former 
ask response 
t-task associ- 


ative connec- 
transfer from 
nsideration of 
erature on back- 
Cognizance of this 
begins with Eb- 
» Who wrote that “as 
arning of a series cer- 


(a) generating a list of trigrams from a 
limited alphab, 


ists that given 


others can be inferred, In such Situations, it 
becomes problematic whether Observed fa- 


It is not reasonable 
Sponse learning is the sole source of the 
Tesponse availability measured by recall, 
Strictly speaking, response learning js the 


serial acquisition of the elements 


to suppose that re- 


ever, involves also 


tions that support recall, Thus the recall 
measure is not a pure Measure of response 
learning. In the transfer Situation, context 
associations which tend to elicit responses 
are indistinguishable from specific transferred 
associations, hence in future references to 
response availability what is intended is 
availability arising from response learning 
only. 
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tain connections of the members are 
therefore actually formed in a reverse 
as well as in a forward direction [p. 
112].” Most of the early interest in 
backward associations was centered on 
the problem of remote associations in 
serial learning, the first application к 
transfer apparently not occurring unti 
the appearance of Harcum's paper in 
1953.  Harcum's finding that back- 
Ward associations are transferred has 
Since been confirmed by Murdock 
(1958) with different materials. mE 
further evidence provided by Keppe 
and Underwood (1962); hence a gen- 
crality seems to be in order, The con- 
clusion, then, is that backward gus 
tive connections are a component o 
what is transferred in a transfer situa- 
tion and must be included in a formali- 
zation of transfer phenomena. 


А COMPONENT-SURFACES 
CoNcEPTUALIZATION 


It has been argued that there “е 
three components to that which: 15 
transferred from one paired-associor 
task to another, namely, response ауа!" 
ability, forward associations, and back- 
ward associations, An immediate ye 
Plication of this position is that all o 
the ordinary measures of transfer 
(number of trials to criterion on the 
transfer task; number of correct rê- 
Sponses, or errors, in so many a 
task trials: etc.) reflect what is bes 
called а net effect. Tn other ward а 
single transfer surface whose ordina 
values are given by any one of a 
standard measures is essentially a T 
transfer surface and presumably can a 
analyzed into other, more specific pr 
ponent surfaces. This means аг, 
Since there are three effects transferrec» 
there should be three transfer << a 
cach describing how its particular ef 
fect is transferred as a function of s 
tion in the coordinate system. Th 
Specification of such a triad of surfaces: 
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at least schematically, follows from 
general paired-associate interference 
theory. 

Consider first how response avail- 
ability (that due to response learning) 
transfers from one paired-associate task 
to another. Any response learning 
evolved during the first task must 
transfer maximally to the transfer task 
of paradigms at positions in the coordi- 
nate system where first- and transfer- 
task responses are identical. ^ This 
sian is not perfect to the extent that 

sponse availability includes context 
а other nonparadigmatic associa- 
най. carrying over; in fact, such 
ds EE may even be a source of 
à rference. Response availability 
[e to first-task response learning. 

owever, has а maximum and equal 
реше effect along the Xs axis, that 
Bs at all positions from A-B, A-B to 
A C-B. At the other extreme, none 
fask ie response learning of the first 
of 1s appropriate to the transfer task 
DI. gem involving completely 
nici oe Response 
ы ability acquired extraexperimen- 
md may certainly apply but not 
di availability from first-task 
Mr a a Therefore, a surface repre- 
зона) the _transfer of first-task re- 
X X earning must coincide with the 
mi к zero-transfer plane at positions 

p» A-B, A-D to A-B, C-D. 
шы aiding positions between the 

xg y positive-transfer loci from 
vor ‚ А-В to A-B, C-B and the zero- 

е loci from А-В, A-D to A-B, 
avail ager amounts of response 
Че ability obtain. These varyms 
oe of transferred response avail- 
appli. are seen as arising from the 
wu leability of first-task response 
Enn to whatever characteristics Or 
к Ponents of the transfer-task re- 
$^ ses are responsible for the similar- 
be (relatedness, including opposition) 

tween the first- and transfer-task 
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responses. This argument can be 
summarized by a hypothetical surface, 
called the R surface for response avail- 
ability, and is shown in Figure 1R. 

The second component of paired- 
associate acquisition, forward associa- 
tion, requires some degree of similarity 
between first- and transfer-task stimuli 
in order to induce a nonzero effect. 
If associations A-B are acquired in the 
first task, then to the extent to which 
the stimuli of the transfer task are 
similar to A, the associates of A from 
the first task are elicited in transfer- 
task learning. Thus, those positions 
on a line between A-B, C-B and A-B, 
C-D cannot involve transfer of forward 
associations because completely new 
stimuli characterize the transfer task. 
In progressing from the A-B, C-B 
position toward the A-B, A-B position 
in the coordinate system, first-task 
forward associations become increas- 
ingly more useful in the transfer task, 
until the A-B, A-B position is reached 
where the transfer task is but a con- 
tinuation of the first task. 

As the A-B, A-D position is ap- 
proached (from any direction), how- 
ever, the forward associations acquired 
in the first task become increasingly 
stronger sources of interference in 
transfer-task learning. This interfer- 
ence is inferred from the fact that 
acquisition of A-D associations involves 
the extinction of А-В associations 
(Barnes & Underwood, 1959; Briggs, 
1954; Postman, 1961, 1962a). As will 
be pointed out below, the literature is 
unanimous that relative to the A-B, 
C-D control paradigm the A-B, A-D 
yields negative transfer. The point to 
be made here is that this negativity 
cannot be due to either transferred 
response availability ог transferred 
backward associations (because the re- 
the transfer task are com- 
) and hence must be due to 
ward associations. The 


sponses of 
pletely new 
interfering for 
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F surface, depicting the transfer of 
forward associations, is shown in Fig- 
ure 1F. It is essentially the surface 
proposed by Osgood (1949). 

The third, and final, component to 
be considered is that of backward as- 
sociation. The surface to represent 
this effect must necessarily be sym- 
metric in general form, if not in mag- 
nitude, with the F surface where the 
axis of symmetry is the line between 
the A-B, A-B and A-B, C-D positions. 
Arguing in the reverse direction, there 
must be some degree of similarity (re- 
latedness, including Opposition) þe- 
tween the responses of the first and 
transfer tasks in order for backward 
associations acquired in the first task 
to be elicited in the transfer task. Now 
it has already been pointed out that 
backward associations are a component 
of what is acquired in paired-associate 
learning; hence if, as in the A-B, C-B 
paradigm, the backward B-A associa- 
tions of the first task are elicited during 
transfer-task learning, then the acquisi- 
tion of the backward B-C associations 
will be impeded, Thus the Surface 
Tepresenting the transfer of backward 
associations must be negative at the 
A-B, C-B position. At the A-B, A-B 
position, however, the backward asso. 
ciations to be acquired in the transfer 
task are the Same as those already 
practiced in the first task, hence the 
surface there must be positive, On a 
line between A-B, A-D and A-B, C-D, 
the responses are completely different 
in the transfer task, that is, the stimuli 
capable of eliciting the backward asso. 
ciations of the first task are absent, 
hence the surface must coincide with 
ће Xs-Xp zero-transfer Plane. Such 
a surface is shown in Figure ІВ, 

The three surfaces of Figure 1 are 
not entirely empirical (as was Osgood’s 
single surface) because the three con- 
cepts represented by them are indeed 

primarily concepts. The surfaces are 
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intended to be representative of expec- 
tations as to the nature of the com- 
Ponents of the two-task paired-associ- 
ate situation (transfer) arising from 
the general interference theory of the 
single-task paired-associate situation. р 

The locus of action of the experi- 
mental variables L, and M can now be 
seen somewhat more clearly. In the 
degenerate case where L, — 0, that is, 
where no first-task learning obtains, all 
surfaces collapse into the Xs-Xpr zero- 
transfer plane: Nothing is transferred. 
With successive first-task trials, the as- 
sociations which are to transfer become 
stronger, and the F and B surfaces 
assume their characteristic curvature. 
For example, at the A-B, C-B position, 
as L, increases, the negativeness of 
the B surface increases, presumably 
asymptotically, until a maximum 1s 
reached (see earlier discussion of over- 
learning). — With respect to the R 
Surface, L, trials Serve also to develop 
its curvature, In situations where 
first-task Tesponses are of the highest 
M (ер. familiar nouns), only a few 
trials are required for full development 
of the R surface. Thus L, and M 
enter the analysis as factors determin- 
ing the developmental (over trials) dis- 
Positions of the surfaces relative to 
the Х-ХҺ ero-transfer plane. These 
relationships will emerge more clearly 
when applications of the model are 
discussed below. 

Each Surface is thus a different func- 
tion of the same two variables, Xs and 
Ап. For example, the R surface 


would be described by some function. 
fas 


R= f(Xs, Xn; M, L,), 


Where M anq L, are parameters rep- 
Tesenting the response-M and degres: 
of-L, characteristics, respectively, © 
the transfer Situation. Tf response M 
Say, Were to be studied functionally at 
Some fixed position in the coordinate 
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system, then resulting variations in 
Tesponse availability would be repre- 
sented by 


R=f(M; Xs, Xr, L4), 


Where f is the same function as before, 
aa M is now taken as the variable and 
ѕ Xm, and L, as the parameters. 
Thus there are multiple experimental 
approaches to the ultimate specification 
of the three functions relating the com- 
ponent surfaces to Xs and Xr. The 
most formidable obstacle between the 
UR state of transfer theory and 
abil a goal, however, is the unavail- 
Чу of a suitable similarity (related- 
E. measure for the Xs and Xp 
ieee If such a measure cannot 
: evolved, then the Osgood coordinate- 
E view of transfer phenomena 
ot develop into a quantitative 
theory,5 
Au decomposition of transfer phe- 
fren d into component effects has 
Hm ш consideration by Post- 
Sine 62b). He distinguishes four 
aes of transfer: (a) learning to 
fects. and warm-up, or nonspecific ef- 
нае, (b) response learning, which is 
Presented by the R surface in the 
ml formulation ; (c) associative in- 
Subdividel which has here been formally 
fient * ed into the F and B compo- 
бейле urfaces ; and (d) differentiation 
fer i lists, a characteristic of trans- 
a ks represented by the Xs and Xn 
volve assigned to the paradigm in- 
- Regarding the last source, if 
s and Xy are to adequately represent 
nea of interest to note that in a о 
generate for ae oup EE un 
sts (1963) presented by ae 2 d 
Which E» )a statement P e aem 
relation 109 But in senere а еи age 
Symmetric . pe Bes a unt ted 
similar to ab than ab [e to abc. If this is in 
act the a an ab is to Lu E ee 
Which list eed x eee qe К in ‘transfer 
Problem, | goes into which ask in a in 
Я ‚ hence introducing directionality 


1s 2 ft 
Similarity measure. 
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differentiation between lists, they must 
necessarily include more than scaled 
dissimilarity. Undoubtedly, Postman 
means to include the type of differentia- 
tion characterized by increased cer- 
tainty as to which of the two tasks a 
given response item belongs. Postman 
draws further attention to the fact 
that second-task learning also has two 
phases and that the several effects 
being transferred from the first task 
should be expected to differentially 
affect them. : 

Before proceeding to applications of 
the component-surfaces conceptualiza- 
tion to specific transfer situations, it is 
necessary to take one more step on the 
theoretical level: In addition to the 
defined coordinate system and the as- 
sumed component surfaces, there needs 
to be a statement about the temporal 
relationship between the two phases of 
learning. The role of such a statement 
would be to provide a conceptual 
mechanism by means of which the 
parameters of the transfer problem can 
specify dispositions of the component 
surfaces relative to cach other. For 
example, suppose it were assumed that 
the response-learning phase precedes 
the association-formation phase; then 
for a sufficiently low degree of L, very 
little association formation could occur. 
Therefore, at a position such as the 
A-B, C-B where the sign of transfer 
depends upon the net effect of the 
positive R and the negative B effects 
(see Figure 1), positive transfer would 
be predicted because no B effects 
could have developed. But this is an 
example of a successful application of 
the complete component-surfaces con- 
ceptualization to actual data and hence 
is putting the cart before the horse, 2 
to speak. 1f the original plan of attack 
is to be adhered to, a statement about 
the temporal relationship between the 
two phases should be evolved outside 


of transfer theory. 
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The facts insist that there is no clear 
temporal distinction between the re- 
sponse-learning and association-forma- 
tion phases in verbal acquisition, Three 
studies of particular interest on this 
score are those of Peterson and Peter- 
son (1959, Experiment II), Crothers 
(1962), and McGuire ( 1961). Peter- 
son and Peterson studied the short- 
term retention of consonant syllables 
and discovered that the dependent 
probabilities of subsequent letters being 
correct, given that the preceding letters 
were correct, increased with the num- 
ber of repetitions before recall. In 
other words, parts of a unit may be- 
come available before the unit is avail- 
able as a whole, Crothers, working 
in a paired-associate situation, found 
that when response items comprise 
more than one component, association 
formation may begin between the 
Stimulus and whatever components are 
available before the entire response is 
available as a unit, This conclusion 
for the paired-associate Situation finds 
further exemplification in the work of 
McGuire. The import of these studies 
is: Although response learning and as. 
sociation formation may be effectively 
distinguished, they nevertheless Over- 
lap; for association formation may be- 
gin as soon as at least one of the 
components of the Tesponse item is 
available, 

Certainly the enti 
must be available be 
sponse can be accred 
association formatio 
responding has bec 


these two arguments Say nothing about 
when, relative to each other, the bulks 
of the two processes occur, An ade- 
quate analysis of this problem requires 
first an agreement as to what is meant 
by “the bulks of.” Pending such a 
step, it might be noted in the Under- 
wood, Runquist, and Schulz (1959) 
data that whereas the effects of intralist 


Te response unit 
fore a correct re- 
ited, and certainly 
n continues after 
ome correct; but 
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response similarity on recall of = 
response items disappear after the А 
half-dozen trials, the effects on overa 
learning carry on to the end, an indica- 
tion that response learning may be a 
more rapid process than association 
formation in some situations. i 
theoretical purposes, it will be assume 
that, in general, response learning pre- 
cedes, but overlaps, association i 
lation. Vindication of this step wil 
emerge as applications are considered. 
The coordinate system, the compo- 
nent surfaces, and the assumption just 
made comprise a clearly asymmetric 
formulation: No mention has been 
made of stimulus learning or par re 
tion. In view of the limited role 9 
experimentally manipulated stimulus 
availability in Subsequent paired-asso- 
ciate learning (eg, Schulz & cape 
1964; Schulz & Tucker, 1962), an 
because of the small effect of stimulus 
M relative to response M (e.g., Under- 
wood & Schulz, 1960), it does not 
seem profitable at the present time to 
develop an accommodating structure 
within transfer theory. That pia 
stimulus factors will prove to be О 
considerable importance in piede 
sociate learning is clear from the van 
of McGuire (1961) and Shepar 
; however, the implications for 
transfer situations are not yet clear. 


APPLICATIONS 


In order to see how the foregoing 
Proposed conceptualization applies to 
extant data and generates M 
Several of the more thoroughly studie 
Paradigms will be considered separately. 

A-B, C-D. First, it is clear п 
the analysis of the net surface into 
component surfaces leaves the A E 

-D position as the appropriate contro 
for all other positions; in fact, its 
naturalness as a contro] is emphasized. 
Since the transfer task involves new 
responses, none of the backward asso- 
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ciations formed in L, is elicited; nor is 
there any “head start" on transfer- 
task response learning since the re- 
sponse familiarization effected in L, 
1з completely irrelevant. Similarly, 
Since the transfer task involves new 
stimuli, none of the forward associa- 
tions formed in L, is elicited. On the 
other hand, the transfer task does bene- 
fit from the nonspecific transfer effects 
common to all the paradigms. Cer- 
tainly the measure of transfer used, 
and hence the choice of a control, de- 
pends upon the intent of the investi- 
gator; but if the specific effects of 
response availability and forward 
and backward associations are being 
studied, use of the A-B, C-D control 
seems imperative. 

A complication alluded to by Post- 
man (1962b) regarding the adequacy 
of the A-B, C-D position as a control 
when low-M material is used should be 
mentioned. Although not so stated by 
him, the cause for concern arises when 
the items are, say, consonant trigrams 
(CCCs), thus making it very difficult 
to satisfy the stipulation (required for 
the adequacy of the A-B, C-D position 
as a control) that first- and transfer- 
task stimuli and responses are com- 
pletely dissimilar. Due to the limited 
Соп of letters (21) from which 
"e can be formed, at least some 
i mal similarity between lists of any 
ength will obtain. 

F A-B, C-B. Of the three effects, R 

; and B, available for transfer at the 
end of L,, only R and B are actually 
transferred at the A-B, C-B position. 

ince the transfer task involves new 
stimuli, forward associations are not 
elicited, hence leaving in opposition the 
positive R and negative B effects. 
This much is based simply on the as- 
Sumed component surfaces. In order 
to form an expectation as to the cir- 
cumstances in which the net R-B effect 
Will be positive or negative, it is neces- 
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sary to evoke the assumption that re- 
sponse learning precedes association 
formation. Thus when degree of L, 
is sufficiently low, only response learn- 
ing will have occurred, to the exclusion 
of association formation, thereby allow- 
ing only R effects to evolve. Increas- 
ing the degree of L, permits the op- 
posing negative B effects to develop. 
As degree of L, goes from low to high, 
the net R-B effect shifts from positive 
to negative. In this way, the theory can 
account for the observed variations in 
transfer at the A-B, C-B position as a 
function of degree of L,. 

A similar use of the R and B com- 
ponent surfaces and the assumption re- 
garding the temporal order of response 
learning and association formation 
leads to an explanation of the role of 
response M in transfer at the A-B, 
C-B position. When low-M first-task 
responses are used for a given degree 
of L,, L, is primarily taken up with 
response learning and only secondarily 
with association formation ; but when 
high-M responses are used, such an 
extensive response-learning phase is 
not necessary, allowing more emphasis 
on the formation of the backward asso- 
ciations which will cause interference 
in the subsequent transfer task. Ап 
jncrease in response M, then, means an 
increased opportunity for the develop- 
ment of negative B effects, hence either 
a decrease in positive transfer or an 
increase in negative transfer, depend- 
ing on the given degree of Li: 


The foregoing theoretical treatment 


of the effects of degree of L, and level 


of response M at the A-B, C-B position 
is intended as an explanation of the 
data cited earlier as discrepant from 
Osgood’s single surface. Several pre- 
dictions for situations in which there 
are as yet no data can also be gener- 
ated. One of these has to do with the 
effect of variations in first-task intralist 


response similarity. Since it is well 
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known that high intralist response 
similarity tends to . impede overall 
paired-associate learning but to facili- 
tate response learning (e.g., Feldman 
& Underwood, 1957; Underwood, 
Runquist, & Schulz, 1959), decreasing 
amounts of negative transfer should be 
associated with increasing intralist re- 
sponse similarity. In comparing two 
groups, one with high and one with 
low first-task intralist response simi- 
larity, during L, the former will have 
evolved more Tesponse availability 
(stronger R effects) than the latter in 
equal numbers of trials. This is be- 
cause high similarity facilitates re- 
sponse learning. (In addition, the 
difference between these two groups 
should be larger for low-M response 
materials than for high.) Further, in 
the group with high intralist response 
similarity, acquisition of the associative 
connections (B effects) which will sub- 
sequently produce interference in the 
transfer task will be 
greater extent th 
low similarity, 
sponse similarity 
formation, 


© response M. For 


should be able to inde- 
pendently manipulate the R surface by 


such expediencies as varying the num. 
ber of "chunks" (Melton, 1963) in the 
first-task response members or admin- 
istering varying amounts of response 
familiarization prior to beginning the 
first task. Тһе consequences of such 
procedures would be differential em- 
phases on association formation during 
L, and hence variations in the net R-B 
effect. 
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A-B, A-D. At the A-B, A-D posi- 
tion, one sees from Figure 1 that 
neither R effects nor B effects are 
transferred, leaving only the interfer- 
ing forward associations acquired dur- 
ing L,. This unqualified expectation 
of negative transfer is unanimously 
borne out by studies using the A-B, 
C-D control. In order to explain the 
roles of degree of L, and level of re- 
sponse M at this position, it is again 
necessary to evoke the assumption that 
Tesponse learning precedes association 
formation. 

Variations in degree of L,, regard- 
less of position in the coordinate sys- 
tem, entail variations in degree of as- 
Sociation formation: Since response 
learning precedes association forma- 
tion, the time allotted to the formation 
of the forward associations which 
subsequently cause interference during 
transfer-task acquisition depends di- 
Tectly upon the amount of L, in excess 
of that required for the response-learn- 
ing phase. Thus as degree of L, in- 
creases, the amount of negative trans- 
fer at the A-B, A-D position should 
increase, 

A similar argument leads to the ex- 
Pectation of increased negative trans- 
fer with increased M of first-task re- 
Sponses, Although the R surface 15 
Inoperative at the A-B, A-D position, 
variations in response M differentially 
delay (inversely according to level of 

the association-formation phase of 

а. 

The foregoing derivations regarding 
degree of L, and level of response M 
are in complete accord with the data 
cited earlier, 

Tesult which can be accounted for 
11 terms of component surfaces but not 
by the single transfer surface is that of 
the Young and Underwood (1954) re- 
Sponse-predifferentiation study. They 
studied three Paradigms situated along 
the edge of the surface from A-B, 
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A-D to A-B, A-B. Halt of the sub- 
jects were given predifferentiation 
training on the response items of their 
first- and transfer-task lists by means of 
рея discrimination task, the other 
ialt receiving irrelevant predifferentia- 
tion training. Those in the latter cate- 
gory yielded the expected gradient of 
decreasing positive transfer with de- 
creasing interlist response similarity ; 
those receiving relevant response pre- 
differentiation yielded а markedly 
modified gradient, best represented by 
a horizontal line. In other words, 
ase response predifferentiation 
Mel ш the gradient. Now response 
Fen m can be seen to have 
т е ects in such a situation. First, 
ere repetition (during the verbal dis- 
er irr n task) of the items which 
the e sequently to be responses raises 
a x surface, that is, increases re- 
my availability, with the maximum 
So reer at the A-B, A-D posi- 
- econd, the Xp value for the 
priam involved is increased ; this is 
c ause predifferentiation serves to in- 
Tease the distinction between the re- 
к анд of the two lists. The net 
i ct of these two factors should be 
Fi ei for paradigms close to the 
the рын position since changes in 
R surface are less effective there 
e e at its maximum at A-B, A-B), 
ia ne m dropping F surface would 
Pind b out a considerable negative 
ots Pim the increase in Xr. On the 
Р г hand, the net effect оп paradigms 
in the A-B, A-D paradigm should 
e positive: Paradigms near there re- 
pics nearly the full effect of increased 
ure availability but only a negligi- 
negative effect due to increased Xn. 
ње 1 summarizes the signs of the 
components of the net transfer 
ect for the three paradigms dis- 
cussed, plus the А-В, А-В. Note that 
able 1 is a tabular restatement of 
Figure 1 for the four positions listed. 


TABLE 1 


DIRECTION OF COMPONENT 
TRANSFER EFFECTS 


Component effect 


Paradigm = 
R 


(Response (Forward (Backward 

availability sociations) | associations) 
A-B, A-B + + | + 
A-B, C-B + 0 - 
A-B, A-D 0 == 0 
А-В, C-D 0 0 0 


А-В, A-Br® Although the Osgood 
coordinate system is entirely adequate 
for providing a position for every 
paired-associate transfer paradigm, an 
examination of the A-B, A-Br para- 
digm leads to the conclusion that the 
component-surfaces formulation pro- 
posed in this paper cannot be applied 
to a certain class of paradigms of which 
the A-B, A-Br is a member, Three 
peculiarities of this paradigm seem of 
importance. First, because the exact 
same stimuli and responses appear in 
both tasks, it must be that, for this 
paradigm, Xs = Xa = 0, and hence 
that the A-B, A-Br is assigned the 
same position as the A-B, A-B. But 
the latter represents a maximum posi- 
tive-transfer situation while the former 
typically involves considerable negative 
transfer. This is because in the A-B, 
A-Br the transfer-list re-pairings re- 
quire transferred F and B effects to be 
negative. Therefore, in spite of the 
fact that the A-B, A-B and A-B, A-Br 
paradigms occupy the same position in 
the coordinate system, the same triad 


1 designates transfer situa- 
tions where the stimulus and response mem- 
sfer task are identical with 
t that they are ге- 
The notation was introduced by 
Porter and Duncan (1953), the first to study 
this paradigm using verbal materials. Gagné, 
Baker, and Foster (1950) discuss à similar 
situation in motor transfer problems. under 
the name oí complete reversal learning. 


6 This paradigm 
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of component surfaces cannot describe 
the transfer effects of both. Й 

Second, although each transfer-list 
stimulus has a new response, the re- 
sponses are not new in the sense that 
they are in the A-B, A-D paradigm ; 
and further, the old responses are still 
present somewhere in the list, not 
completely removed as in the A-B, A-D 
paradigm. To understand the import 
of such an arrangement, consider a 
single transfer-task pair where the 
stimulus member is the same as it 
was in the first task but where the re- 
sponse member is different, There 
are two kinds of response items which 
may be assigned: a completely new 
response item, that is, an item not 
used in the first task (1); or an old 
response item, one alrea 


dy used with 
a different stimulus in the first task 
(2). But both 


of these kinds of re- 
sponse items may pe assigned to the 
stimulus under either of the following 
two conditions: (а) where the original 
response item, the one that went with 
this stimulus in the first task, is com- 
pletely removed from the transfer list; 
or (b) where the original response item 
is retained elsewhere in the transfer 
ist as the Tesponse to some other stim- 
ulus. The four combinations, la, 1b, 
2a, and 2b, form a 2x2 table where 
across the top, say, is denoted which 
kind of response member the transfer- 
task pair has, new (1) or old (2), and 
down the side is denoted the transfer- 
task status of the original (first-task 
response, removed (a) or retained (b), 
This table is shown as Table 2. Con- 
ditions 1а and 2b are the usual A-B, 
A-D and A-B, A-Br Paradigms, re- 
spectively, 

As can be seen from 
fer effects listed in Table 2, the A-B, 
A-Br paradigm involves not only a 
Positive R effect and negative F and B 
effects, as would be expected from a 
simple consideration of the three com. 


the signed trans- 


TABLE 2 


BREAKDOWN OF TRANSFER EFFECTS 
IN THE А-В, A-D AND А-В, 
A-BR PARADIGMS 


Type of second-list response 
Status of original 
Tesponse 
dis New (1) Old (2) 
(A-B, A-D) 
R: 0 R: т 
Removed (a) | V: — Е: = 
B: 0 B: = 
(A-B, A-Br) 
R: 0 R: Y 
Retained (b) F: -,— F: a 
В: 0 B: 


ponent effects, but an additional nega- 
tive F effect. This is because the 
presence of old, first-task responses 
tends to elicit baclewardly the inter- 
fering forward associations acquired in 
the first task, thus causing these assoct- 
ations to be more resistant to extinc- 
tion^ An experiment utilizing essen- 
tially the design of Table 2 has been 
conducted by Besch, "Thompson, and 
Wetzel (1962) with results consonant 
with expectations based on the table. 
The third peculiarity attaching to the 
A-B, A-Br paradigm which has theo- 
retical significance is that transfer in 
an A-B, A-Br Situation must neces- 
Sarily be a list phenomenon: The re- 
pairing operation cannot be carried out 
on a single pair, Transfer theory as 
discussed in this paper prior to the 
treatment of the A-B, A-Br is in no 
Way contingent upon a distinction be 
ween transfer from a first to a secon 
list and transfer from a first to a second 
pair. А 
From the preceding observations, 
two queries come to mind in quick suc- 
7 That firstlist associations are extin- 
guished during second-list learning has been 
adequately established. For evidence am 


discussion on this point, see Briggs (1954); 


Barnes and Underwood (1959), and Post- 
man (1961, 19624). 
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nt First, since it is clear that the 
oted peculiarities of the A-B, A-Br 
ee arise from the operation of 
= iring, is it not plausible to sup- 
se that such an operation can define 
шше new set of transfer paradigms? 
is second, since a system of com- 
ponent surfaces clearly serves well to 
шек I transfer phenomena in un- 
a ise (ordinary) paradigms, is it 
ve pae to wonder if an equally 
bed eable conceptualization might not 
rige for the re-paired paradigms? 
m. im of re-paired paradigms is 
пеев is tto characterize. In the first 
ordine hey will be as numerous as the 
ird. paradigms because for any 
н у paradigm a re-paired para- 
iz "eu be obtained by simply carry- 
сн Ly the re-pairing operation. Re- 
bus s paradigms involving completely 
Baa ransfer-task stimuli and/or re- 
арцы Roul not, of course, be dis- 
Ой, hable from their corresponding 
for ED paradigms. | This means that 
cut E: indistinguishable paradigms 
i md of transfer surfaces de- 
diem to handle the re-paired para- 
se s mt coincide with the system 
loci E aces shown in Figure 1. The 
is Mesa up would be along the 
Eom A-B, C-B to A-B, C-D and 
-B, C-D to A-B, A-D. 

a ni ad characteristic of such a sys- 
гераі уре paradigms is that the 
heran, F and B surfaces must be 
ал, for all positions where first- 
"— р and backward associations 
Mi icited in the transfer task. The R 
Vas ae however, would be the same m 
the re-paired and ordinary cases. 
А as as the present writer knows, the 
nly data pertaining to re-paired trans- 
er situations is that involving the A-B, 
e paradigm : There have been no 
ies in which nonzero values of 

s and Xy have been utilized. 
As a final remark on the A-B 
Paradigm, it is of interest to no 


, A-Br 
te that 
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compared to the A-B, A-D the A-B, 
A-Br involves (besides its extra nega- 
tive F effect) the net R-B effect, the 
opposing positive and negative effects 
of transferred response availability, and 
backward associations. This opens up 
the possibility of expecting /ess nega- 
tive transfer using the A-B, A-Br para- 
digm than the A-B, A-D for experi- 
mental specifications generating à 
sufficiently positive net R-B effect. 
Such a situation would arise whenever 
either very-low-M response materials 
were used and/or very few trials of L, 
were given, thus making response 
learning the predominant first-task 
process. In fact, using trigrams as 
responses with L, taken to a 3/6 cri- 
terion (approximately 3.7 trials), Jung 
(1962) reports transfer in the A-B, 
A-Br paradigm to be less negative than 
that in the A-B, A-D; that is, he re- 
ports a positive R-B net effect. Ata 
higher degree of first-task learning (a 
6/6 + 5 criterion), the crossover has 
occurred, and the A-B, A-Br paradigm 
is more negative than the A-B, A-D. 
The corresponding role of first-task 
response M was demonstrated by 
Merikle and Battig (1963) : When first- 
task responses were low M (CCCs), 
approximately zero transfer was ob- 
served in the A-B, A-D paradigm but 
considerable positive transfer in the 
A-B, A-Br; however, as M increased, 
transfer in the A-B, A-Br paradigm 
became even more negative than the 
increasingly negative A-B, A-D. In 


general, studies using adjectives as Te- 


sponses report more negative transfer 
in the A-B, A-Br than in the A-B, A-D 
paradigm (Besch & Reynolds, 1958; 
Besch, Thompson, & Wetzel, 1962; 
Kausler & Kanoti, 1963; Keppel & 
Underwood, 1962; Porter & Duncan, 
1953; Postman, 1962b ; Twedt & Un- 
derwood, 1959), whereas those using 
trigrams report less (Jung. 1962; 


Mandler & Heinemann, 1 56). Thus 
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le 
t R-B effect plays the same ro 
m A-B, A-Br paradigm as in the 
A-B, C-B paradigm. 
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SOME STIMULUS DIMENSIONS 
OF ROTATING SPIRALS 
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The perceptible motion of rotating spiral lines can be analyzed into 3 


components: motion normal to the lin 


motion. 


e, rotational motion, and radial 


General equations for these 3 components have been derived. 


Specific formulas for finding the 3 components for Archimedes, loga- 
rithmic, and hyperbolic spirals have been given in terms of distance 


from the center, speed of rotation, 


these spirals. 
have, respectively, the properties 
motions are constant for all dis 
than a minimum distance. 


Several new methods for measuring 
aftereffect of seen motion have re- 
cently been developed (Carlson, 1962; 
Scott, Bragg, & Smarr, 1963; Scott, 
Jordan, & Powell, 1963 ; Scott & Med- 
lin, 1962; Scott & Powell, 1963; Singer, 
1959; Taylor, 1963), and recent 
physiological findings (Hubel, 1963; 
Maturana, 1963) have given promise 
of an imminent increase in the under- 
standing of these effects, Eliciting 
motions of three Principal types have 
been used: linear, rotary, and spiral. 
The first two types of motion are 
fairly simple, and little difficulty has 
“been encountered in the quantification 
of the stimulus dimensions. However, 
in the case of the spiral, whose motion 
is somewhat more complex, no de- 
tailed treatment of the stimulus 
dimensions appears to have been 
worked out. Any precise treatment 
of aftereffect produced by rotating 
spirals depends crucially upon the 
precision of knowledge about the 
stimulus dimensions. ‘The present 
discussion attempts to provide an 
analysis of those stimulus dimensions 
of rotating spirals which are of 
greatest potential interest to the in. 
vestigator of visual motion aftereffect. 
The first puzzle which must be 
clarified is the apparent mystery asso- 


and the constants associated with 
In addition, 3 “special” spirals have been derived which 
that the normal, radial, and rotational 
tances from the center of rotation greater 
Possible applications have been suggested. 


ciated with the so-called "spiral 
effect" or “spiral illusion” (see "X 
1959). Some students have found 
hard to understand how a e 
spiral can appear to expand or CO 
tract when it is "really" just пн 
It is obvious that а tightly haer 
spiral with many turns will appear = 
have less radial motion than a pr 
open one with fewer turns rotated е 
the same speed. Consider, cd 
a spiral with zero turns. Such " 
“spiral” is really just а “epoken 
Rotation of such a stimulus will ш r 
only rotational aftereffect. Consi ^ 
now a spiral with, say, a single tur: 
This figure will look like a "curve 
Spoke." It can be easily db. 
that this figure will elicit two pacti 
aftereffect at the same time: ГО E 
tional and radial. Somehow, = e 
the number of turns of the spiral 3 
comes less, the perceived motion i a 
its aftereffect undergo a ge etd 
change from almost purely radia lus 
entirely rotational, The am a 
object, of course, has only a up 
Motion. How, then, can the m 
formation from a rotary motion m 
composite of rotary and radial тойо 
take place? 

To answer this question, it is helpful 
to consider the conditions necessa 


344 


е n — 


STIMULUS DIMENSIONS 345 


for the perception of motion per se. 
No homogeneous contour on a homo- 
geneous field can be seen to move if 
the successive positions of the motion 
do not change the visual pattern re- 
ceived by the eye. For example, a 
circle rotated about its center, or a 
straight line moved along its axis, will 
be self-congruent (see Platt, 1958) 
for all positions for the types of 
motion involved, and no motion will 
be perceived in such cases. These 
considerations suggest the axiom that 
the only perceptible motion of a contour 
ts that component of its motion which is 
normal to the contour. If the contour 
is curved, this component will be 
normal to the curve at the point 
Whose perceptible motion is to be 
evaluated. When seen in this light, 
the apparent expansion or contraction 
of a rotating spiral is not an illusion 
at all but is the direct consequence of 
the perceptually effective motion of 
the spiral lines. The “latency of the 
spiral effect" as studied by Stern 
(1959) is, then, no more than the 
latency for motion perception under 
the particular conditions used. Fur- 
ther analysis can now be undertaken. 
The following questions need to be 
i What is the exact speed of 
E perceptible motion of a given 
EC spiral at a given distance 
m the center of rotation? (“Рег- 
ceptible motion” here is used in the 
Sense of the axiom stated above.) 
hat is the direction of this motion? 
low much of the perceptible motion 
Can be represented as radial and how 
much rotational? What types of 
Spirals will produce what functional 
relationships between radial motion, 
rotational motion, and distance from 
© center? 
it In order to answer these questions 
15 necessary to state in mathematical 
orm what is meant by а spiral. A 
Seometric analysis can then be under- 


taken in order to discover the con- 
sequences of the perceptible-motion 
axiom as it applies to particular types 
of spirals. 

For the purposes of this discussion, 
a spiral is defined as any monotonic 
function of the form 


р = f0), 


where p and 0 are polar coordinates. 
There are thus as many different 
kinds of spirals as there are monotonic 
functions of 0. Consider the spiral 
OPX (see Figure 1) with P any point 
on the spiral. APB is tangent to the 
spiral at the point P. The objective 
rotational velocity is represented by 
the vector PZ, perpendicular to the 
radius vector p. If rotational velocity 
is represented by c revolutions per 
unit of time, 


PZ = 2тор. 


Let PN be perpendicular to APB. 
The vector PQ is the projection of 
PZ on PN. It is proposed that the 
vector PQ represents the velocity of 
the perceptible component of the 
motion of the spiral line at the point P. 
Let ZAPO = ұ. Since ZNPO = 90° . 
+y and ZZPO = 90°, LZPQ = № 


and the length of vector PQ is there- 
fore given by 


PQ = 2rwp cos V [11 


Equation 1 represents the total per- 
ceptible motion which will be called 
normal motion since its direction 15 
normal to the curve. It can, of course, 
be resolved into a radial component 
and a rotational component. These 
are shown by the vectors PE and | PK, 
The length of PE, or 


respectively. gth of | 
the speed of radial motion, is given by 
PE = РО sin v, 


\ 


2тор cos V sin V. [2] 
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Fic. 1. Geometric 


[9] 


representation of the 


Perceptually effective components of the 


motion of a rotating spiral. 
to the spiral OPX, with P a 
spiral. PN is normal to t 


point P. PZ represents the 


velocity. PQ 


motion. PK and PE 
on PZ and PM, res 


rotational component, 
component.) 


For PK, or the rot; 


(APB is tangent 
Ny point on the 
he spiral at the 
physical rotational 


is the Projection of PZ on PN. 
PQ represents the norm; 


al component of 
are projections of PO 
Pectively. РК is the 
and PE is the radial 


ational motion, the 


formula is 
PK = PQ COS y, 
= 2пор cos? y, [3] 


The next task i 


formula for the angle y. 


be verified (see, e.g 


Longley, 1941) that th 


obtained from 


5 to find a usable 
It can easily 
-; Granville, Smith, 


е angle y can 


respect to 0. Where , ; 


function of 6, the fo 


tan (180° — y) = 


tan y — 


Obviously, then 
must take into ac 


p/p’, 
= pho. [4b] 


» а formula for y 
count the type of 


spiral under consideration. E 
kinds of spirals have been used id re : 
ommended for use in researc JA. 
aftereffect of motion : the nent i 
the logarithmic, and the hyper em 
In the case of the Archimedes spiri 
the formula is 
p= аб, 

А т " ive 
where a is a constant. The derivativ 
with respect to pis 

p= a. 
From Equation 4a 


tan y = аб/а = 0 = р/а, 

V = arctan p/a. [5] 
For the logarithmic or “equiangular’ 
spiral the formula is 

peg 

where b is a constant. The derivative 
p! = be, 
Again from Equation 4a 

V = arctan 1/b. [6] 


ic Or 
The formula for the hyperbolic 0 
reciprocal spiral is 


р = c/0, 


à Я, tive 
Where c is a constant. The derivat 
is 


P = — с/в, 


: r " ; ely 
Since in this case p and 0 are invers 


d, 
related, Equation 4b must be use 
and 


V — arctan c/p. [7] 


Equations 1, 2, and 3 can now s 
Tewritten for each of the three ag 
ОЁ spirals considered by ag pt 
the appropriate expressions for 
angle y, ; 

For example, for the Archimedes 
Spiral Equation 5 is substituted, a 


ee 
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Equation 1 becomes 


PQ 


2тор cos arctan ру d, 
2атор 
Veta 


Equation 2 becomes 


PP... t 
PE = 2rop cos arctan p/a 
sin arctan p/a, 
_ 2атор? 
pra 
Equation 3 becomes 
pL 
PK = 2тор cos? arctan р/а, 
_ 2а?тор 
а" 
" Similar substitutions of Equations 
um 7 yield corresponding expres- 
ns for the logarithmic and hyper- 


o 
а spirals. These formulas аге 
mmarized in Table 1. 


N о A^ 


PERCEIVED MOTION (IN. PER SEC.) 


5 1.5 
DISTANCE FROM CENTER 


normal, radial, and rot 


Fic. 2, Graphic representation of the 


Arc] 
ре chimedes spiral as a function o 
Г second, ) 
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TABLE 1 


SuMMARY OF FORMULAS FOR THE INDICATED 
COMPONENTS OF PERCEIVED MOTION 
FOR THREE KINDS OF SPIRALS 
AS A FUNCTION OF DISTANCE 
FROM THE CENTER (p) AND 
SPEED OF ROTATION (w) 


Г distance from the center. 


" Normal Radial |Rotational 
Spiral formula | motion motion motion 
(PQ) (PE) (PK) 
Archimedes: | 2azop Qarwp? | 2a*zep 
р = аб Ма | ete eta 
Logarithmic:| _267#e_ | 2bzep | 2b?rwp 
pet Уй -„1|Ё+1 | &+1 
EET " 
Hyperbolic: | <2“? — 2стор? 2p? 
00 = с у | ete | + 


Figure 2 represents graphically the 
normal, radial, and rotational motions 
as functions of distance from the 
center of an Archimedes spiral. It 
can be seen by inspection of the 


2 2.5 3 
(INCHES) 


ational pct for the 
= 1 revolution 


(a = .04, 
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formula and the curves in Figure 2 
that the normal motion increases with 
negative acceleration and approaches 
2arw asymptotically. Тһе radial 
motion, although somewhat less than 
the normal motion, approaches the 
same asymptote. Rotational motion 
reaches a maximum and thereafter 
approaches zero. To find the maxi- 
mum, the original formula, 


EL 2 
PK = 2а?тор 


P+ a? 
can be differentiated with respect to p: 
dPK 
"dp 7 Trola + TE] 


+ 2a*zo[a* + Py. 
Simplifying and setting equal to zero: 
2а?т®[а? + E= 4а?тор? 
Lo + py =o 


at this function 
P (and therefore 


2а?т® з 0 
Therefore 
d —pg-g 
pex. 


Interestingly, then, 
rotational motion fo 
spiral is obtained ex 
the center of rotat 
this value of р 
formula: 


the maximum 
r the Archimedes 
‘actly a units from 
ion. Substituting 
into the original 


| 


2а?тва 
а а = mod. 


Kinax = 


У 


The maximum rot 


ational motion is 
thus exactly half th 


e limiting normal- 


and radial-motion rates. The Archi- 
medes spiral, then, in addition to being 
easy to draw, would be the stimulus 
of choice in experiments where rela- 
tively uniform radial motion is desir- 
able, and where little rotational 
motion is wanted—provided that the 
experimenter studies motion sub- 
stantially more than a units from the 
center of rotation. ` 
The logarithmic spiral has different 
but equally interesting properties. 
nspection of Figure 3 and the second 
line of Table 1 shows that all three 
types of motion increase without 
limit as linear functions of distance 
from the center of rotation, The 
logarithmic spiral has been a favorite 
geometric figure among mathemati- 
cians for many years because of its 
numerous interesting properties (see 
Turnbull, 1956; Weyl, 1956), and it 
15 not surprising that it also has 
interesting perceptual properties when 
rotated. A rotating logarithmic spiral 
filling the visual field would be 
visually similar to an approaching Of 
Teceding object with parallax and 
accommodation artificially removed 
from the Situation. As a laboratory 
Stimulus it would also have the ad- 
Vantage that "approach" or ''reces- 
Sion" could be continued for indefinite 
lengths of time without either running 
mto the stimulus or having the 
Stimulus-obser sor distance exceed rea- 
sonable values. It should be possible 
to demonstrate 4 very curious рег- 
ceptual ambiguity with this kind of 
Spiral. Under appropriately reduced 
cues, an observer should be unable to 
tell whether the logarithmic spiral is 
changing its distance from him Or 
whether it jg simply rotating! From 
the expressions in Table 1, it can easily 
© Seen that the ratio of radial to 
rotational Motion of a logarithmic 
Spiral is everywhere 1/6. The value 
of b used in Constructing Figure 3 was 


— we Sh 
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(INCHES PER SEC.) 


N 


PERCEIVED MOTION 


n А ————] 


| 2 3 
DISTANCE FROM CENTER (INCHES) 


Fic. 3. Graphic representation of the normal, radial, and rotational components 
for a logarithmic spiral. (b = .5, w = 1 revolution per second.) 


chosen to produce twice as much 

radial as rotational motion. The 30 
Tatio of radial to rotational motion 
Can, of course, be given any desired 
Value by proper selection of the value 
of b. 

With both the Archimedes and the 
logarithmic spirals, it is possible to 
increase the number of turns by in- 
creasing the size of the spiral. This 
18 impossible with the hyperbolic 
Spiral, however, because the spiral 
Approaches a straight line c units from 
the origin and parallel to the reference 
vector. For the hyperbolic spiral, 
both the normal motion and the rota- 


n 
e 


m 
о 


PERCEIVED MOTION (INCHES PER SEC) 
5 f: ] 


E 


Попа] motion increase without limit > i = à 5 
аз Positively accelerated functions of DISTANCE FROM CENTER (INCHES) 
‘stance from the center of rotation Fic. 4. Graphic representation of the 


See Figure 4). The radial motion normal, rotational, and radial components for 
Оесотеѕ asymptotic to 2 сто thus а hyperbolic spiral. (с = 5, w = 1 revolution 


Sceming to mimic the Archimedes. рег second.) 
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However, at or beyond a value of D 
for which the radial component is 
close to its aymptote, the rotational 
component has become so large that 
radial motion is only a negligible part 
of the total normal motion. The 
hyperbolic spiral would thus seem to 
be a poor choice for most studies of 
visually perceived motion. 

All previous studies of spiral after- 
effect have used one of the three types 
of spirals discussed above. For spe- 
cific experimental purposes, however, 
it may prove advantageous to con- 
Struct special spirals which have 


s for 
In par- 
О design 
mponents 
distances 
from the 
ill now be 
will be to 


P and p' into Equa- 
tions 1, 2, and 3. The resulting 

then be treated as 
differential equations with the vectors 


PQ, PE, and PK treated as constants, 
The first 5 


pecial spiral might be 
called the "isonormal" Spiral. This is 


a spiral which, when rotated, has the 
Same normal motion at all distances 
from the center greater than a mini- 
mum distance. This minimum dis- 
tance would be where the desired 
normal motion equals the objective 
motion. In symbolic terms, it would 


be where PZ = РО. For distances 
less than this, 


the isonormal spiral is 
undefined. From Equation 4a it 
follows that 


u 


p 
cos y — Vg [8] 
and 
sin y = —2—. [9] 
Vip! + р? 


To derive a differential equation for 
the isonormal spiral, Equation 1 can 
be solved for cos y: 


PQ 
cos y — suy" 
Substituting from Equation 8, 
p PQ 


Vp + р? 2тор` 
Squaring both sides, rearranging 
ER ap 
into standard form, and writing 16 


for p’ results in 
472022 [s] ы PQ? 


s mer = 0, [10] 


which is the desired differential equa- 


tion. The solution can be expressed 
as follows :! 


@= [=f T" 
1g 
— arctan [=] — 1. 
PO 


Figure 5 shows 
the isonormal 
values of p the s 
an Archimedes 


the first two turns of 
spiral. For larger 
piral is approximately 
Spiral whose formula is 


=| BS т 
udi Ea LEHIE 
The second Special spiral has the 


Property that the radial motion is 
constant for all distances from the 


! Specifications for isonormal, isorotational, 
and isoradia] spirals have been deposited 
with the American Documentation Institute. 
Order Document No. 8333 from ADI Auxiliary 
Publications Project, Photoduplication Serv- 
ice, Library of Congress, Washington, pz e 
20540. Remit in advance $1.75 for mera- 
film or $2.50 for Photocopies and make checks 
Payable to: Chi 


i ef, Photoduplication Service, 
Library of Congress, 


бе 
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Fic. 5. First two turns of the isonormal 


spiral. (Rotation of this figure produces the 
Same total perceptible — motion—motion 
normal to the curve—at all distances from 


the center. The center is indicated by the 
dot.) 


center greater than a minimum. The 
minimum for the isoradial spiral is the 
Point where the objective motion is 
twice the desired radial component. 
In symbols, PZ = 2 PE. As proof of 
the latter, it can be readily seen (see 
Figure 1) that the maximum radial 
component for any given value of PZ 
and pwill be wherey = 45°. Triangle 
ZPQ will be an isosceles right triangle, 
K will bisect PZ, and PK will equal 
QK. Since OK = PE, PZ = 2 PE. 
To find the differential equation, 


Substitute from 8 and 9 into 2 as 
follows: 


2тор " -— — 
Vp? + р? Vp? + р? 


which, in standard form, becomes: 


2mop? dp = PE 
|g РЕр 


РЕ 


The solution is: 
EU e ЕЗІ 1 
PE PE 
— arctan EN — i} 
PE 
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Figure 6 shows the first two turns of 
the isoradial spiral (using positive 
values of the bracketed function). 
For larger values of p this equation, 
too, is approximately that of an 
Archimedes spiral. This time the 
approximation formula is: 


PE T 


The third and last special spiral is 
the isorotational spiral which has the 
same speed of rotational motion at all 
distances from the center, greater than 
a minimum. Аз in the isonormal 
spiral, the minimum is the point 
where the desired component equals 
the objective motion, or in this case, 
where PZ — PK. For values of p less 
than this, the isorotational spiral is 


undefined. Solving Equation 3 for 
cos? y, 
Е РК 
cos?’ y = а 
2rwp 


Substituting from Equation 8, 


po PK 
pF p? 2rop' 


Fic. 6. First two turns of the isoradial 
spiral. (Rotation of this figure produces the 
same perceptible speed of radial motion at all 
points on the curve. This figure is to be con- 
trasted with Figure 5. The final value of p 
is the same for both figures.) 
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5 


and rearranging into standard form 
yields 


dp]? == к 
Е = PE 


dp | 
S e| e 12 
= БЕ [ Ф| 0, [12] 


which is the differential equation for 
the isorotational spiral. The solution 


is quite similar to that for Equation 
105 


EL 


= 2 arctan 4/ i — 1. 
PK 


Figure 7 shows the first two turns of 
the isorotational spiral. For larger 
values of p, the Spiral has approxi- 
mately the form 
РО 
= РО [o + т]. 


8то 


Analysis of the sti 
of rotating spirals, then, turns out to 
be somewhat more involved than one 
might at first expect. The student of 
Perception might be tempted to by- 
pass these complexities by employing 
simpler stimuli, Several investigators 
have chosen simpler stimuli on the 
grounds that the spiral introduced 


mulus dimensions 


Fic. 7. First two turns of the isorotational 
spiral. (When this figure is rotated, the 
perceptible rotational component is the same 
at all distances from the center. The final 
value of p is equal to that of Figures 5 and 6.) 


unnecessary complexities into the ex- 
perimental situation. Prior to the 
present analysis, such arguments were, 
of course, quite cogent due to the 
unknown character of these ,com- 
plexities. Now that an analysis has 
been made, however, there are some 
rather compelling advantages to the 
use of rotating spirals. Motion after- 
effect was first studied with linear 
motion, as in the familiar waterfall 
illusion. Most theories of aftereffect 
at that early date had to do with eye 
movements and eye-muscle innerva- 
tions. Plateau (1850) disproved the 
universality of such theories precisely 
by introducing the rotating spiral as 
a stimulus. The rotating spiral with 
a central fixation point is a superior 
stimulus in any experiment where eye 
movements might be an important 
control variable. The method of 
measuring rate-of-motion aftereffect 
introduced by Cords and Brucke 
(1907) and extended by Singer (1959); 


Scott, Bragg, and Smarr (1963) ; 


Scott, Jordan, and Powell (1963) ; 
Taylor (1963); and perhaps others, in 
which a test stimulus moves slowly in 
the same direction as the inspection 
stimulus so as to cancel the aftereffect, 
Presents peculiar difficulties in water- 
fall-illusion experiments. In such 
experiments а pattern of moving 
Stripes typically is seen during the 
inspection Period inside some kind of 
frame, usually rectangular. The test 
stimulus, then, is seen inside the same 
Ог a similar frame. The real motion 
of such a test stimulus can be per- 
ceived by Comparison of point cor- 
Tespondence from the stimulus to the 
frame and by distance observations 
made relative to the frame. A үн! 
sible way of avoiding these undesired 
Cues is the elimination of the frame. 
Here the inspection stimulus and the 
test stimulus are seen against a field 
of total darkness. The difficulty with 
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this solution is that during the test 
period the well-known phenomenon 
of autokinetic movement may in- 
troduce unknown and uncontrolled 
amounts of error which have in addi- 
tion the disadvantage of being highly 
Susceptible to suggestion and pre- 
conception on the part of the subject. 
The use of a rotating stimulus having 
radial lines drawn on it, for example 
in Wohlgemuth's (1911) studies, has 
many of the advantages associated 
with the spiral. Indeed, the present 
analysis shows that such stimuli are 
merely special cases of the spiral. 
More complex rotating patterns, as in 

aylor's "swirling lines," have the 
advantages of rotating stimuli in 
general, but do not lend themselves 
to the study of more complex problems 
such as those associated with differ- 
ential aftereffects toward and away 
from the fovea. 

Different investigators have used 
Spirals differing in type, pitch, number 
of "throws," proportion of black and 
White areas, diameter, and speed of 
rotation. Viewing distances have 
been different between experiments. 
Aftereffect while viewing the same 
pattern stationary has been timed. 
Aftereffect while viewing some other 
Pattern has been timed. In Scott's 
experiments, only the radial com- 
Ponent of aftereffect was measured, 
even though rotational aftereffect was 
Certainly present. In Taylor's ex- 
Periments only rotational aftereffect 
Was measured ; even though the possi- 
bility that his swirling lines may elicit 
other directions of aftereffect remains 
untested. Many of these experiments 
done by different investigators in 
different parts of the world are 
directed at fundamental questions re- 
Sarding motion perception. When 
Conflicts appear in the data of these 
diverse studies, some exact method of 


quantifying the stimulus conditions is 


353 


needed. For example, in the studies 
of Price and Deabler (1955), a 920? 
Archimedes spiral 6 inches in diameter 
rotating at 78 and 100 revolutions per 
minute (rpm) was used. Viewing 
distance was 8 feet. A study by 
Scott, Bragg, and Smarr (1963) which 
differed substantially in result from 
that of Price and Deabler involved a 
1440? Archimedes spiral 8 inches in 
diameter rotated at 160 rpm and 
viewed at a distance of 6 feet. In 
Price and Deabler's study, the spiral 
itself served as the test stimulus. The 
aftereffect, then, was presumably 
aftereffect resulting from the "normal 
motion" of their spiral. In the study 
by Scott, Bragg, and Smarr, only the 
radial aftereffect was measured. This 
aftereffect presumably resulted from 
the radial component of the particular 
spiral involved. 

In comparing these two studies, a 
precise understanding of the difference 
between the two stimulus conditions 
is essential before intelligent hy- 
potheses can be made regarding the 
difference in outcome. Using the 
method derived here, the normal 
motion at the edge of the spiral in 
Price and Deabler's study was 70 
minarcs per second for the 100-rpm 
speed and 54 minarcs per second for 
the 78-rpm speed. In Scott, Bragg, 
and Smarr’s experiment, the maxi- 
mum normal motion at the edge of the 
spiral was 126 minarcs per second. 
The radial component of this motion 
was also 126 minarcs per second to the 
nearest minute of arc. In this latter 
study, aftereffect was measured at a 
distance of 1.5 inches from the center 
of the spiral. Even at this distance, 
the radial and normal components 
were still 126 minarcs per second to 
the nearest minute of arc. These 
results show that the speed of the 
eliciting stimulus in the Scott, Bragg, 
and Smarr experiment was from 80% 
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to 133% greater than that used by 
Price and Deabler. : 
A recent study by Powell (1962) in- 
vestigated the differential rate of 
radial aftereffect measured at different 
distances from the fovea. An Archi- 
medes spiral was used in this study. 
An initial assumption was made that 
since the formula for an Archimedes 
spiral is p = a@ and since the rate of 
change of p with respect to 0 is the 
constant a, that different regions of 
the retina were being stimulated with 
the same speed of motion. The 
present analysis reveals that this was 
an erroneous assumption.  Fortu- 
nately in this case, the combination of 
spiral and range of distances from the 
fovea chosen by Powell were such 
that the erroneous assumption was 
actually a close approximation to the 
result which would be obtained by the 
more exact method developed here, 
This example, however, reveals the 
necessity of such an analysis for the 
more detailed study of the dynamics 
of motion-aftereffect phenomena, 

If the theoretical model presented 
here has validity, it should be possible 
to show similarities between results 
obtained by different investigators 
using similar eliciting speeds of mo- 
tion. Similar results might be ex. 
pected in comparing linear-motion 
aftereffect to spiral aftereffect. 


terefi p Data 
exist in the literature which make 
possible comparisons of this kind. 


From the available data, three st 
have been selected for compa; 
The first of these was done by Granit 
in 1928. In Granit's Study, the 
eliciting stimulus was a rotating drum 
having parallel black-on-white bands 
drawn on it. The drum was seen 
through a square window, and the 
objective speed of the bands was 10 
centimeters per second. Granit meas- 
ured the duration of the aftereffect 
resulting from this virtually linear 


udies 
rison, 
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motion at a number of different view- 
ing distances. He was ostensibly 
studying the effects of retinal area on 
duration of the aftereffect. Although 
in his preliminary discussion of the 
problem he was obviously aware that 
the speed of motion of the stimulus 
pattern across the retina also varied 
with viewing distance, he seemed to 
ignore this in his subsequent discus- 
sion of results. For present purposes, 
Granit's results will be treated as if 
speed of the eliciting motion were the 
only variable, but it should not be 
forgotten that size of retinal area was 
also varied in his experiment, Figure 
8 shows the average duration of after- 
effect obtained by Granit asa function 
of speed of eliciting motion (solid 
circles, solid line). The speeds of 
stimulation are expressed in minarcs 
per second based on a simple trigo- 
nometric transformation of the linear 
motion of the bands on Granit's drum 
into units of visual angle. As can be 
seen, the aftereffect increased from 
stimulating speeds of about 72 min- 
ares per second, reaching a maximum 
at approximately 120 minarcs per 
Second and thereafter decreasing. 
Granit also varied the brightness of 
the stimulus and the degree of dark 
adaptation, but the results from all 
these different conditions have been 
Dooled as have the results from the 
three subjects he used in his experi- 
ment. It should be noted that the 
Pooling of all these variables might be 
expected to have some adverse effects 
on any attempt to compare his results 
with others, This fact will make any 
similarity between his results and 
other results all the more striking. 

_ Scott (1962) carried out a norma- 
tive study in which eight different 
Archimedes spirals were used. These 
spirals varied in pitch and number of 
throws. The measure of aftereffect 
used was the rate of expansion Or 
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500 
(MINARCS PER SEC.) 


300 400 


of the aftereffect obtained by Granit, Scott, and 


iments plotted on a common abscissa : 
ng formulas developed here. (Freud’s 
duration of the aftereffect. Scott’s data, 
artificially adjusted so that the highest 


Point is graphed at the highest point of Granit’s data.) 


contraction of a test stimulus which 
subjectively cancelled the aftereffect. 
bi would seem to be little hope 
os a similarity might be found be- 
oe the results of Scott's study and 
s se of Granit s. The two experi- 
nents differed in several respects. 
d the eliciting motions were quite 
a erent. Second, in Granit's study 
he speed of stimulation was con- 
trolled by changing the viewing 
distance; in Scott's study the speed 
Was controlled by varying the speed 
of rotation of the spirals and the pitch 
of the spirals. Several other variables 
Were involved in both Granit's and 
COtt's studies. The measure of after- 
effect used was different. In spite of 
these many differences, the curve in 
Figure 8 which summarizes Scott's 
Tesults (open circles, dashed lines) 


bears a close similarity to the results 
of Granit. lt must be pointed out, 
however, that since the two measures 
of aftereffect were not directly com- 
parable, the highest point of Scott's 
data has been artificially adjusted to 
match the highest point in Granit's 
data. 

The third study is a recent one by 
Freud (1962). Freud was also in- 
terested in the effect of varying the 
size of the stimulated retinal area, 
and, like Granit, he varied the area by 
manipulating the viewing distance. 
He used an Archimedes spiral and a 
constant speed of rotation. He thus 
committed the same error as did 
Granit in 1928: ignoring the fact that 
changing the distance also changes 
the speed of stimulation. Freud's 
measure of aftereffect was duration. 
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Computation of his speeds of eliciting 
motion and pooling his data across 
all other conditions results in the data 
shown in Figure 8 (Xs, solid line). 
Unfortunately, the range of speeds 
used by Freud was very narrow, but 
within that range the results have the 
same sense as those obtained by 
Granit and Scott. That is, the after- 
effect increases for stimulating speeds 
between 30 and 132 minarcs per 
second. Other variables in Freud’s 
study were the location of the stimu- 
lated area near or far from the fovea 
and whether aftereffect was observed 
monocularly or interocularly. The 
presence of these other variables fails 
to obliterate the similarity of results, 
Thus far, the comparison of various 
results suggests that motion after- 
effect is surprisingly predictable from 
a knowledge of the speed of the elicit- 
ing motion even though other vari- 
ables are known to influence it, 

The availabilit 
isoradial, and 


suggests the possibility of some new 


suggested by 

presented here? 
and rotational 
ent? Most stu 
effect have us 
speeds of stimu 
of the retina ( 
studies and the Archin 
studies). Calculation of stimulating 
speeds for the logarithmic spirals used 
in two recent experiments (Pickersgill 
& Jeeves ;? Singer, 1959) suggest that 
under these nonuniform stimulus con- 
ditions unusually long durations of 
aftereffect are encountered. No doubt 
a number of new experiments will be 


nedes-spiral 


? Mary J. Pickersgill and M. 


A. Jeeves, 
personal communication, 1964, 


devised to test various aspects of the 
“perceptible-motion axiom” and its 
consequences. т 

The equations developed in this 
article, then, have two immediate 
applications. The first is to clarify 
comparisons between the findings of 
different investigators; the second is 
in the design of new experiments. 
Investigators armed with the results 
of the present analysis should be able 
not only to resolve seemingly contra- 
dictory outcomes from different lab- 
oratories but also to choose stimulus 
materials for specific experimental 
purposes in a way much more precise 
than was previously possible. 
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RATING EXTREMITY: 


PATHOLOGY OR MEANINGFULNESS? 


DENIS O'DONOVAN 1 
Florida Atlantic University 


An attempt is made to reconcile those 


to use the extremes of ratin; 
those studies linking polariz: 
gested that simultaneous consi 
fulness and subject personalit: 
seeming discrepancies. A set 


This article attempts a reconciliation 
of two contrasting sets of hypotheses 
about the tendency to use the extremes 
of rating scales. This tendency is reli- 
able (Peabody, 1962; Rundquist, 1950; 
Zuckerman, Norton, & Sprague, 1958) 
and easy to measure, and data have 
been published in support of each set 
of hypotheses, 

The first set links the use of ex- 
treme ratings with pathology. With 
varying amounts of empirical support, 
extreme rating style has been suggested 
as à measure or correlate of neurosis 
(Johnson, 1946), primitive id impulses 
(Brenner, 1955), mental-patient status 

Borgatta & Glass, 1961), deviance 
(Berg, 1957), maladjustment (Berg 
& Collier, 1953; Zax, Gardiner, & 
Lowy, 1964), anxiety (Lewis & Tay- 
lor, 1955), intolerance of ambiguity 
(Frenkel-Brunswik, 1949: Soueif, 
1958), inflexibility (Schutz & Foster, 
1963), desire for Certainty (Brim & 
Hoff, 1957), ethnocentrism, authori- 


tarianism, and rigidity ( Adorno, Fren- 
kel-Brunswik, 


Levinson, & Sanford, 
1950; Mogar, 1960), and dogmatism 
(Rokeach, 1960). For instance, 


Wertheimer and McKinney (1952) 
completed while 
sity of Missouri 
Florida. 


1 Parts of this paper were 
the author was at the Univer 
and at the University of 


studies linking the tendency 


g scales (polarization) to pathology with 
‘ation to more effective behavior, It is sug- 
deration of stimulus and task meaning- 
y classification will shed light on the 


of propositions consistent with the results 
of both sets of studies is advanced. 


of verbal learning for improved reseca 


Implications of laboratory studies 
rch in response style are discussed. 


report, ". . . neurotics exceed the con- 
trols in overreaction in using more 
often the extremes on a rating device 
[p. 59]." 

In contrast, the second set of hy- 
potheses links the use of extreme rat- 
ings with meaningful commitment ama 
constructive behavior. Extremeness A 
ratings has been used or suggeste 
as an operational definition of vj 
cisiveness (Cromwell & Caldwe » 
1962), saturation of meaning (Mitsos, 
1961), committing oneself versus eva 
siveness (Broen & Wirt, 1958), saen” 
lus importance (Blum, 1964), ie 
sity of meaning (Kanungo & Lambert, 
1964), involvement (Saper, 1965, 
emotional investment (Morris, Eidu 
son, & O'Donovan, 1960), edunt, 
ment (O'Donovan, 1960, 1962), же 
meaningfulness (Isaacson, 1962). oe 
Studies have been stimulated by "m 
earlier writings of Morris (1938, 1940, 
1956) and Kelly (1955, 1958). - 

* most emotionally neutral ein 
used for extremity of rating is S ew 
of polarization (Osgood, Suci, & € 
nenbaum, 1957 ), defined operationa i. 
as the distance from the point of u^ 
(neutral point) on a rating scale. e 
Convenience, polarization will be br 
in this article to denote a tendency to 
rate more extremely ; depolarization 
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RATING EXTREMITY 


denote a tendency to rate closer to the 
neutral point. While the rating scales 
used in the studies referred to above 
differ in various details, in each case 
the average deviation from the neutral 
point of the ratings of each subject (or 
group of subjects) was used to test 
some hypothesis. In other words, de- 
gree of polarization was used as an 
Operational definition of some tendency 
which was also given a conceptual 
definition, 
aeris is a fourth and rela- 
at Fel neglected aspect of response 
yle. In his review of response style 


as a personality variable, McGee 
(1962) states: ` ' 


i big has been confined almost ex- 
ii hae ME only three. types of response 
duree Y the social desirability set, charac- 
désitabi by the consistent endorsement of 
able e traits and the denial of undesir- 
ботен D the deviation of a pattern of 
a m S from the typical pattern produced by 
eas, population of responders; and the 
tee set, which consists of tendencies 
ae oose the “true,” "agree," or "like" op- 
alt rather than their respective negative 
ernatives [p. 284]. 


x role of polarization in the gen- 
ы study of response set in personality 
M essment (Berg, 1965; Jackson & 
isi 1962) and its relationships to 
ei desirability (Crowne & Mar- 
ad e, 1960; Edwards, 1957), devi- 
(Cou (Berg, 1961), and acquiescence 
Pura & Keniston, 1960; Martin, 
) are not clear at this time. 

"er is one obvious reason for the 
С glect of polarization. With the ex- 
eption of Berg's (1961) four-choice 
erceptual Reaction Test (PRT), most 
Of the instruments used in response- 
Style studies have been true-false or 
ichotomous forced choice in form. 

nder these limited circumstances, 
leviation, acquiescence, and social de- 
Sirability all seem more appropriate for 
idy than polarization. However, 
he definition of depolarization could be 
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considered to cover the use of a permis- 
sible-evasive response to a true-false 
item or even the omission of any re- 
sponse. The tendency to avoid com- 
mitting oneself is well documented as 
a stable personality characteristic 
(Cronbach, 1950; Guilford, 1954; 
Lorge, 1937). Rubin-Rabson (1954) 
found a negative correlation between 
this tendency and self-sufficiency. Ed- 
wards and Walsh (1964) found this 
tendency to be unrelated to acquies- 
cence and social desirability. The dif- 
ficulty with this use of the concept of 
depolarization is that the instruments 
provide no opportunity to measure the 
full range of the polarization-depolari- 
zation continuum. 

Six studies suggest that polarization 
(in its more usual definition) is readily 
distinguishable from acquiescence, as 
variously defined. 

Couch and Keniston (1960) found 
that response categories agree and dis- 
agree were more highly correlated with 
overall agreement score (acquiescence) 
than were strongly agree and strongly 
disagree. Schutz and Foster (1963) 
found one factor for acquiescence and 
another for polarization. They sug- 
gest that the tendency to choose 
strongly agree probably has a different 
meaning from the tendency to choose 
moderately agree. In a factor analysis 
of responses to neutral questions, 
Broen and Wirt (1958) found two 
nearly orthogonal factors, one of the 
tendency to agree and the other of the 
tendency to disagree, rather than one 
acquiescence factor. Polarization was 
positively loaded on the first factor. 
The tendency to be ündecided was 
negatively loaded on both factors. 

Zuckerman, Oppenheimer, and Ger- 
showitz (1965) found that actors dif- 
fered from teachers in polarization but 
Forehand (1962) 
asures related to 
measures not 


not in acquiescence. 
found polarization me 
each other, acquiescence 
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related to each other, and polarization 
not related to acquiescence. Zucker- 
man et al. (1958) found acquiescence 
positively related to authoritarian 
parental attitudes, and polarization 
negatively related to these attitudes. 

The relationship between polariza- 
tion and deviation appears clear at first 
glance: If the typical pattern is polar- 
ized, polarization is not deviant, other- 
wise it is. However, there are a num- 
ber of complications. Sechrest and 
Jackson (1962, 1963) point out some, 
including the problem of the deviantly 
nondeviant. Johnston (1964) found a 
group of psychologists who followed 
the typical cultural value pattern to a 
greater extent than the typical Ameri- 
can, that is, they rated the accepted 
values higher and the rejected values 
lower than the norm group. Whether 
their polarization defines them as devi- 
ant, nondeviant, or deviantly nondevi- 
ant is not clear. 

When social desirability is added, the 
picture becomes even more clouded. 
For instance, Zax, Cowen, and Peter 
(1963) found that novice nuns devi- 
ated from college females i 
greater polarization on ratin 
desirability. 

While the present discuss: 
marily limited to the one 
style” of polarization, i 
meaningfulness places 
mental agreement 
emphasis on 


n showing 
gs of social 


ion is pri- 
“response 
ts emphasis on 
it in funda- 
with the recent re- 


item content (McGee, 
Ко Peabody, 1964, 1965; Rorer, 
965; 


ponse style be- 
than item con- 
nfusion, 


Denis O’Donovan 


lus and subject variables. For а 
stance, Zuckerman et al. (1965) foun 
greater polarization for actors than а 
teachers on the РКТ. They e 
gested two interpretations; "a genera 
behavioral deviancy which expresse 
itself in an inability to modulate Gah 
attitudinal reactions” or E arc А 
emotionality or drive [p. 170]." n 
hypothesis that the abstract designs М 
the test might be more meaningful t 
the actors was not considered. This is 
consistent with their statement: “The 
PRT is the most standardized of M 
non-content tests [p. 169].” heros 
Berg's (1959) statement of the ШШЕ 
portance of test-item content did not E 
as far as Zuckerman's er ager 
statement, Berg does reveal that i 
PRT was designed to be relatively 
meaningless by Berg, W. A. me 
and E. H. Barnes, all of whom an 
teachers and none of whom are pra 
Meaningfulness is a function of Ke 
subject and the stimulus, Pathology E 
the subject may be expressed in Г 
Sponse to any stimulus. jore 
This article will attempt to exp a 
the conditions under which degree м 
Polarization is more predictable oe 
the stereotypic and perhaps pattiolog 
cal response style of the subject, Ж 
those conditions under which degree = 
polarization is more predictable fro 


degree of meaningfulness of the rating 
task, 


POLARIZATION as PATHOLOGY 


Several of the studies prevail 
mentioned found a positive poscis. 
between polarization and such uc 
Pathologies as authoritarianism 58) 
dogmatism, while Barker ees 
Paine (1964), Peak, Muney, and 958) 
(1960), and Zuckerman et al. a The 
failed to find such a relationship. h in- 
present article deals primarily wit ons 
dividual pathology, that is, the = 
trast between effective behavior 
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neurotic or schizoid behavior. How- 
ever, one statement about the measure- 
ment of social attitudes seems appro- 
priate. When a bigoted person re- 
sponds to stimuli related to the subject 
of his bias, both the pathology hypoth- 
esis and the meaningfulness hypothesis 
pon predict polarization. Any test 
: the two hypotheses would necessarily 
olee unrelated stimuli. An example 
5 the interactions involved is the study 
by Mogar (1960). He found that 
Poste) increased with more con- 
e ersial (here translated meaningful) 
imuli and with more authoritarian 
Subjects, 
А ei i (1961) hypothesized that 
fee of thought disorder called the 
a ency to think in terms of absolute- 
ue dichotomies leads to a higher 
Probability of suicide. Neuringer’s 
| шо definition of this tendency 
i пацан, as defined above. He 
ibo E semantic differential (Osgood 
i 957) with concepts "pregnant 
siete pus meanings" and dimen- 
tive P nid weighted on the evalua- 
ий е Не found that suicidal 
Pa and psychosomatic patients 
oe eded general medical and surgical 
Кеке, (without obvious neurotic dis- 
Seri) in polarization. Suicidal and 
шанс patients did not differ 
е each other, From these results 
concluded аё: “Dichotomous 
ене Thinking seems to be a 
di mon characteristic of emotionally 
Sturbed persons [p. 449]." 
co assumptions explicit or im- 
x Cit in Neuringer's ( 1961) study can 
ed testioned. He explicitly assumed 
T the normal hospitalized subjects 
Te under the least stress and did 
Not follow up on the possibility that 
Stress or disinterest might have re- 
Sulted in constriction of their ratings. 
€ seems to have assumed that the 
€xperimental task was equally mean- 
ingful for each group of subjects, 
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though no subjects were consulted on 
this question. Most important, the 
statement of the conclusion quoted 
above seems to include schizophrenics, 
though no known schizophrenics were 
included in the study. Zax, Loiselle, 
and Karras (1960) report no signifi- 
cant differences in extreme ratings be- 
tween schizophrenics and controls, but 
greater use by schizophrenics of neutral 
ratings. It should be added, however, 
that Zax et al. (1964) found greater 
use of extremes by schizophrenics. 

A more comprehensive reformulation 
of Neuringers (1961) conclusion 
might be that the tendency to respond 
with similar behavior in dissimilar sit- 
uations is a common characteristic of 
disturbed persons. Dichotomous eval- 
uative thinking or polarization, when 
inappropriate, is simply a special case 
of this general tendency shown by 
some disturbed persons, notably those 
with certain cognitive organizations 
often labeled neurotic. 

Long before the current flurry of re- 
sponse-style research, Fenichel (1945) 
said: “Patients, instead of reacting 
vividly to actual stimuli, according to 
their specific nature, react repeatedly 
with rigid patterns [p. 542].” The 
three key words in this interpreta- 
tion are: vividly, suggesting that an 
intense response is sometimes appro- 
priate ; repeatedly, suggesting that 
more than one sample of behavior is 
needed to assess rigidity; and patterns, 
suggesting that no one pattern is the 
rigid pattern. К 

One example of effective organiza- 
tion is the statement of a scientist 
quoted by Eiduson (1962): "If you 
never chase sidelines, you never find 
anything new; if you chase all the side- 
lines, you never find anything because 
you are running down too many blind 
alleys [P- 126]." This suggests to the 
writer that an effective person polar- 
izes toward meaningfulness : already 


362 


existing meaning in the adjusted per- 
son, potential meaning in the creative 
person. He neither dissipates his en- 
ergy in intense Tesponses to every 
trivial stimulus nor misses chances for 
riate polarization. 

ucro the ineffective person's 
behavior is more predictable from his 
customary response style, even when 
the situation calls for quite different 
responses. The general case has been 
called stereotype, rigidity, persevera- 
tion, lack of discrimination, etc. Two 
special cases are polarization and de- 
polarization. 


DEPOLARIZATION As PATHOLOGY 


A large number of studies have been 
conducted on the predictability or 
stereotypy of behavior in monkeys 
(Harlow & Zimmerman, 1959; Mason 
& Green, 1962), chimpanzees (Daven- 
port & Menzel, 1963; Menzel, Daven- 
port, & Rogers, 1963), mental defec- 
tives (Berkson & Davenport, 1962), 
institutionalized infants (Ribble, 1943), 
schizophrenics (Adams, 1960; Armi- 
tage, Brown, & Denny, 1964; Hunter, 
Schooler, & Spohn, 1962: Luchins, 
1959; Painting, 1961; Rodnick & Gar- 
mezy, 1957), and process versus reac- 
tive schizophrenics (Reisman, 1960; 
Zlotowski & akan, 1963). АП of the 


maternal care, 
man subjects, 
infrahuman subjects, 
what is comm 
fect. 


of depolariza- 
found in Bopp 
an, Morris, and Eidu- 


tion in schizophrenics js 


(1955), O'Donov; 
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son (1959), and Morris et al. (1960). 
Evidence of depolarization in less-effec- 
tive guidance counselors and in schizo- 


phrenics is found in  O'Donovan 
(1960). 


PoLARIZATION As MEANINGFULNESS 


Three studies using the same meas- 
ure of polarization as gue e 
(1961) illustrate the second set : 
hypotheses. Mitsos (1961) had sub- 
jects choose the 9 (of 21) semantic- 
differential scales most personally 
meaningful. Polarization was greater 
on these scales than on others. с 
well and Caldwell (1962) foun 
greater polarization on subjects’ own 
dimensions, taken from their Role Con- 
struct Repertory Tests (Kelly, 1955), 
than on dimensions taken from the 
RCRTs of others, Saper (1964) 
found greater polarization on a Sper 
cially designed “involvement” question- 
naire for subjects who discussed a i 
history with each other than for su» 
jects who merely read the case pei 
Subjects also reported that the ане 4 
discussion was а meaningful exper! 
ence, 

As can be seen from these examples 
the concept meaningfulness has a 
given various operational definition 
This fact calls for much further carefu 
work but is no bar to continued P 
gress. We can expect that meaningfu 
ness, being a stimulus characteristic, 
will continue to have more operatione: 
definitions than will polarization, а 16 
Sponse characteristic. It is very wee 
to make any stimulus more meaningft i 
this is done every time an organism t- 
Conditioned. The most ambitious v 
tempt at a rigorous hypothetico dede 
tive system in psychology (Hull, 1 s 
1952) limits itself to four basic T 
Sponse measurements while pecie 
for an infinite variety of stimulus mea 
urements, 


s hich 
Various operations, many of wl 
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are discussed by Underwood and 
nos (1960) and Osgood et al. 
oo, have been used to define cul- 
or consensual meaningfulness. 

Nus studies using personal or idio- 

phic meaningfulness as an inde- 
аа variable have followed the 
= ax of Allport (1953, 1961) 
ak elly (1955). Allport advises 
Ue xperimenter to ask the subject. 
(1908 this technique, O’Donovan 
е н А greater polarization for 
fl ^s atements checked as meaning- 

| than for those not so checked. 
с asserts that а person’s own di- 
lineis n (constructs) are more mean- 

8101 to him than are those constructed 
Du Persons, Isaacson (1962) 
im greater polarization for subjects' 
fer ше than for those derived 
O utler and Haigh (1954) or 

jo et al. (1957). 
ens ad evidence indicates that sub- 
dite iffer greatly in what they con- 
Eod meaningful. They even differ 
Quem in what they mean by the word 
ily - ingful" Yet they respond read- 
iud hen asked to rate meaningfulness 
amm ени as if they know what they 
ке loing. That is, this task seems 

aningful, or makes sense, to them. 
had foregoing is not meant to predict 
E. Seeds operational definitions 

E 2. ningfulness will be found to refer 
the фа, underlying mechanisms in 
lane ividual subject, but to predict 
ingful operational definitions of mean- 
tine used in the studies men- 
ж will be found to correlate posi- 

€y with polarization. 

e 15 possible to argue that a person 
Un the victim of pathological 
е. d as evidenced by dichotomous 
te when concepts or scales 
iene ан to him are used. But it 
meani equally plausible to speak of 
M A ningfulness leading to strong com- 
in Ment. A pattern of intense yeasay- 

€ and naysaying may be most appro- 
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priate in the most meaningful situa- 
tions. Mild affect and tolerance for 
ambiguity may be out of place when 
being pursued by a bear or pursuing 
the girl one loves. 


POLARIZATION IN THE LABORATORY 


The laboratory situation is here con- 
sidered a special case of the meaning- 
ful situation. This assertion will not 
be defended here. Several recent dis- 
cussions (Farber, 1963; O'Donovan, 
1963 ; Orne, 1962; Riecken, 1962) sug- 
gest that the subject may find meaning 
even when none is intended by the ex- 
perimenter. If it is agreed that the 
laboratory experience is meaningful 
and is more meaningful when meaning- 
ful stimuli and/or scales are used, then 
the following relationships between 
polarization and measures of retention 
and reliability fit into the general rubric 
of meaningfulness. 

Heretofore neglected in the litera- 
ture on extreme-response tendencies is 
evidence relating what looks like po- 
larization to effective learning, psycho- 
physical, and psychometric behavior. 
A thorough review of these studies 
would cover virtually the history of 
psychology, at least from Yerkes and 
Dodson (1908). Rapaport (1942) 
summarizes many studies relating in- 
tensity of emotional response to reten- 
tion. Koch (1930) reports a curvi- 
linear relationship between rated 
pleasantness of examination grades 
and later recall of these grades, with 
grades rated "indifferent" being most 
likely forgotten. Postman and Murphy 
i3) found that word pairs either 


(1943) 1 
strongly compatible or strongly in- 
compatible with subjects’ attitudes 

ter than word pairs 


were retained bet 1 
only mildly compatible or incompatible. 


Pertinent studies focus on meaning- 
fulness and learning. Thorndike 


(1935) reported superior retention of 
“cherished” over “worthless” material. 
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Underwood and Schulz (1960) have 
thoroughly reviewed relationships Ље- 
tween extremity of rating and meaning- 
fulness (particularly pp. 19-25) and 
between meaningfulness and learning. 
Noble (1958) deals directly with emo- 
tionality or affectivity ratings and 
meaningfulness. Osgood et al. (1957, 
pp. 155-159) report a relationship 
between polarization and speed of Te- 
sponse. Earlier, Postman | and Zim- 
merman (1945) had linked intensity of 
attitude to speed of response. А line 
of studies dating at least to Henman 
(1911) reports similar findings. Sla- 
mecka (1963) relates speed of Tesponse 
to stimulus meaningfulness. In short, 
there is evidence linking stimulus 
meaningfulness, speed of response, and 
polarization. 

It may be that polarization, far from 
being considered pathological, 


is now 
taken for granted in the learning lab- 
oratory. For instance, Mechanic 


(1962) writes: “Tt m 
sumed that the high-m 
will evoke strong differ 
at the outset, while the 
items will not readily 
sponses [p. 594] » 

hat is meaningful w 
enters the laboratory is 
prior commitments, Three studies us- 


ing the Allport-Vernon (1931) or АП- 
port-Vernon-Lindzey (1951) Study of 
Values illustrate this point and its ef- 
measures, 


ay also be as- 
eaningful items 
ential responses 
low-meaningful 
evoke such re- 


hen the subject 
related to his 


economic meanin 


finding for subjects with strong poli- 
tico-economic values. Vaughan and 
Mangan ( 1963) found greater resist- 
ance to group pressure for religious 
subjects when religious questions are 
discussed, and for subjects with strong 
economic values, when economic ques- 


tions are discussed. Postman, m 
and McGinnies (1948) found that а 
jects more quickly recognized wor 
related to their stronger values. " 
Some studies in social aber ee 
or person perception, such as ped. 
Sherif and Hovland (1961), direc A 
relate polarization with о иаи ч 
Polarization increases with ee 
volvement of the subject (Hovlan 
Sherif, 1952; Manis, 1960; Shy a 
Allport, & Barnett, 1958), with the uli 
troduction of rewards or valued pen : 
to an ordinary jsychaptsstéu o а 
ment task (Tajfel, 1959; Tajfe РЫ 
Cawasjee, 1959), and with wo om 
relevance of rating categories e) 
torf, Richardson, & Dornbush, 19 m 
Tajfel and Wilkes (1964) main ius 
that "knowing what is important to nd 
rater may enable us to predict how e 
when he will tend to use his more € 


: heir 
treme judgments [p. 48]." T 
conclusion was: 


ich 
Attributes which appear early and S 
are repeated frequently in free чес ee 
of other people tend to be assigned He have 
treme ratings than attributes pen to be 
low frequency and priority and ten thai 
judged as more important in a person 
low ranking attributes [p. 47]. 


It has long been observed that aee 
Scales and stimuli which do not oe 
Courage polarization have poorer idy- 
liability. Block (1957, р. 359), ae 
ing the phenomenology of магт 
notes that subjects tended to empha afe 
middle intervals in rating peciam to 
fects, such as nostalgia, in on 
more meaningful or vivid affects, eee 
as love, thus bringing about d zm 
liabilities. Luria (in Osgood LATET. 
1957, p. 250) found a M o amis 
:81 between polarization and bres о 
reliability. A direct empirical к 28 
meaningfulness, polarization, "yn and 
liability was attempted by Smit ome? 
Kendall (1963). Tn а technique 1955), 
what similar to that of Kelly ( 
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e were evaluated in the nurses’ 
in! а за In contrast to the 
inge е ings with supervisors’ rat- 
d М extremes of these scales were 
(1963) raters. Smith and Kendall 
iet А ее that “dimensions 
е g ul to the raters" were the chief 
ны in obtaining evaluative rating- 
ale reliabilities above .97. 


(ат 
OMPARISON OF DIFFERENT STUDIES 


a ate certain obvious differ- 
linkin etween those studies apparently 
thoss g polarization with pathology and 
with А. orae d linking polarization 
DU Mee The pathology- 
by mci hypotheses are supported 
Beim les E which rating dimensions 
"Aes posed upon the subject. He has 
E isa as to which dimensions he 
sah r which stimuli he rates. We 
ж ге that the stimuli and dimensions 
When n personal meaningfulness, since 
that thee do ask the subjects, they say 
Жору d do. We further assume that 
bul ration for responses of varying 
arization is provided, to use Hull- 
Ter» terms, by a mixture of relevant 
in Irrelevant drives and the increase 
— vig drive level concomitant 
tiene and neuroticism (Hull, 
На: Janet T. Spence, 1963; К. W. 
pow 1960; Taylor & Spence, 1952). 
de kis other hand, the meaningful- 
poste: р hypotheses are sup- 
Bs y studies in which the subject 
Dtm an opportunity to provide his 
C personally meaningful dimensions 
genres & Caldwell, 1962; Isaacson. 
дк choose which dimensions or 
1961. are more meaningful (Mitsos, 
REM O'Donovan, 1964), or take part 
m. vitat he or others believe to be a 
1964 meaningful experience (Saper, 
; also laboratory references). 

This brings us back to the central 
question of this discussion : Under what 
Conditions is degree of polarization 
More predictable from stereotypic and 
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perhaps pathological response style, and 
under what conditions is degree of 
polarization more predictable from 
meaningfulness of the stimuli being 
rated and/or the rating scales being 
used ? 

Most of the studies cited have meas- 
ured or categorized only the meaning- 
fulness of the stimuli or the personality 
classification (e.g., normal versus dis- 
turbed) of the subjects. What is 
needed is a set of propositions leading 
to predictions of the interaction of 
stimulus meaningfulness and personal- 
ity classification. The following prop- 
ositions are suggested. 


Basic PROPOSITIONS 


1. Inappropriate or pathological 
rigidity in an organism is best studied 
by measuring lack of differentiation be- 
tween responses in two or more func- 
tionally dissimilar situations, to use 
Dollard and Miller’s (1950) term. In 
this discussion situations differing in 
meaningfulness have been stressed. 
However, this proposition does not de- 
pend on any of the other propositions. 
Even should the present propositions 
concerning meaningfulness and polari- 
zation not prove useful, the reader 
should still be wary of using one set 
of responses to functionally similar 
stimuli as an operational definition 
of pathological rigidity. 

2. The extremeness of an organism's 
response will depend to some extent on 
the meaningfulness of the stimuli. If 
we arbitrarily dichotomize the variable 
of meaningfulness, this proposition 
leads to the prediction that response 
to meaningful stimuli will tend toward 
the extreme (polarize). while response 
to meaningless stimuli will tend toward 
the indifferent (depolarize). 

3. The extent to which degree of 


polarization depends upon the mean- 
ingfulness of stimuli is related to other 
personality chara 


cteristics. 
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4. Effective behavior and lack g 
emotional disturbance are associate 
with selective use of extreme responses. 
The more effective and/or less dis- 
turbed the individual, the more prob- 
able that Proposition 2 will predict his 

ehavior. р 
xit —— behavior and emotional 
disturbance are associated with less dis- 
criminate use of extreme responses, 
with less differentiation between mean- 
ingful and meaningless stimuli. In the 
event of total collapse of effective be- 
havior patterns, the predictive power 
of Proposition 2 will also collapse. . 

6. As Proposition 2 loses predictive 
power, propositions based on the indi- 
vidual's usual response style gain pre- 
dictive power. 

7. Predictions can also be based on 
personality classifications. 


Two ex- 
amples follow. 


SPECIFIC Propositions ron. Groups 
DEFINED By PATHOLOGY 


8. Neurotics and psychosomatic pa- 
tients polarize both meaningful and 
meaningless stimuli. 

9. Schizophrenics an 
sons depolarize both 
meaningless stimuli, 

An example of results postdicted by 
these propositions is found in Linde- 
man and Adams (1963). Normals 
rated simple light flashes in a different 
style than they rated abstract designs, 
while schizophrenics did not. "Their 
interpretation Was that "the smaller 
number of Tesponse differences found 
in the Schizophrenic Sroup would seem 
to indicate some basic factor common 
to the schizophrenic process, 
the data are not conclusive it is Possible 
that the small number 


different responses is the result of poor 
discrimination or lack of attention fre. 
quency found in schizophrenic patients 


[p.77]. 
Two studies dealing with meaning- 


d schizoid per- 
meaningful and 
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fulness and retention, but not explicitly 
with polarization, showed results con- 
sistent with these propositions. Sher 
man (1957) discovered that mex 
prison subjects retained meaningfu 
material—but not nonsense Зан 
better than did neurotic subjects. Ni- 
dorf (1964) used the serial-anticipation 
method to test learning of ers ri 
syllable lists differing in vedo d 
ness. Increased meaningfulness I 
tated the learning of both normals an 

schizophrenics, but facilitated the ird 
ing of normals significantly more tha 

it did the learning of qe nie p 
Neither of these studies suggested De 
mechanism responsible for these aiet 
ences. If we follow the ory, e 
the propositions and suggest Fi w^ 
Mechanic’s (1962) assumption и 
the high-meaningful items will br 
strong differential responses at the ou : 
set [p. 594]" is at least a parallel E 
polarization, both of these finding 
would be expected. ss 

These propositions are also pur 
tent with the results both of eid 
Cited in support of the pathology jan 
pothesis and of those cited in suppr 
of the meaningfulness пафасе 
Where meaningfulness is not Sat 
ated, these propositions predict en 
neurotics will polarize (Neuringer, 
1961; Wertheimer & Меке, 
2), and schizophrenics ванна 
(Bopp, 1955; Morris et al, 1960; 
O'Donovan, 1960). Where gel 
fulness is evaluated, meaningful «oil 
uli and/or rating scales lead to a 
tion (Cromwell & Caldwell. ‚1 E 
Isaacson, 1962; Mitsos, 1961; O'Don 
van, 1964; Sa er, 1964). 

For the effective individual (or d 
the normal, if that is the best Ms 
Tation available), these pruposta 
predict a response whose vigor she 
relates with the meaningfulness О oe 
stimuli. For the disturbed or eie 
fective individual, predictions m 
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иа of the person’s usual 
fee style or his personality classi- 
Sheet will be more powerful than 
the ions made from knowledge of 
meaningfulness of the stimulus. 


IMPLICATIONS FOR FURTHER 
RESEARCH 


ae sophisticated techniques of 
(Jenkins. stimulus meaningfulness 
& font Russell, & Suci, 1958; Staats 
1964: Pe е a polarization (Blum, 
Боры еа оду, 1962), and their rela- 
er. i M enkins, 1960; Koen, 1962; 
| p 63) are currently available. 
signif stance, Amster (1964) reports 
eis cant effects of contextual pleas- 
i on polarization and recall. Zil- 
Sett : ion and De Cencio (1964) pre- 
Son н епсе of the effect on polariza- 
itin Аня interaction between е 
en a the rater and the instructions 
is him, These new techniques and, 
E dee the findings and more 
eae д eveloped methods of the verbal 
© kE and verbal behavior field need 
studi applied to rating extremity 
Pire It is clear that whether polari- 
has n is considered “sick” or “healthy” 
He cene d depended on whether the 
[eias ar study focused on stimulus 
tics rsen or subject characteris- 
ium ocusing on the interaction be- 
ties 1 stimulus and subject characteris- 
snd may provide. considerable clarity 
ins even a reconciliation of the studies 
ntioned. 
ao important questions in this 
ec may only be answered by studies 
tai ending over time. The most cer 
К 1 test of whether a person’s cognitive 
ш is rigid or allows change is the 
ow Dost facto measurement of change. 
Such studies are now being conducted 
at the University of Missouri Mental 
ygiene Clinic (Landfield. Nawas. & 
Donovan, 1962). Results to date 


(Ourth, 1963) conflict with the hy- 
is a reliable 


Pothesis that polarization 15 
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measure of pathology and suggest that 
polarizers may be more capable of im- 
provement in psychotherapy than de- 
polarizers. The rating scales used in 
these studies were derived from the di- 
mensions of personality description 
used by the individual clients them- 
selves before entering psychotherapy 
and by their assigned psychotherapists. 
Following the logic of the propositions, 
we would expect that those who find 
their own dimensions and those of their 
therapists meaningful will polarize, and 
will later find their psychotherapy more 
meaningful and hence more successful. 

Landfield (1964) had clients in psy- 
chotherapy rank in order of "felt use- 
fulness in describing others" the dimen- 
sions which they themselves and their 
therapists had used to describe signifi- 
cant persons. Clients then rated them- 
selves on these dimensions. It was 
discovered that (а) self-ratings on 
clients’ own dimensions were more 


polarized, and (b) ratings on dimen- 
anked as more useful were more 


sions ri 
polarized. 
Further research based on these 
propositions might investigate whether 
other positive 


successful therapy ОГ 
experience leads to increased differen- 


tiation in degree of polarization to 
meaningful and meaningless stimuli. 
As an individual becomes more au- 
tonomous OT self-actualizing. his re- 
sponses may more consistently follow 
Proposition 2. He may also have a 
clearer, more conscious notion of what 
is meaningful to him. The ability to 
state to oneself and to others what 15 
meaningful and to respond accordingly 
(as predicted by Proposition 2) may 
emerge aS à psychological model of 


human freedom. 
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INFORMATION ABOUT SPATIAL LOCATION 
BASED ON KNOWLEDGE ABOUT 
EFFERENCE' 


LEON FESTINGER лхо LANCE KIRKPATRICK CANON 


Stanford University 


An experiment was designed to determine whether or not the human 
organism possessed "outflow" information derived from monitoring 


nerve impulses in motor pathways. 


The experiment focused on the 


extraocular muscles since proprioceptive input to the central nervous 


system from these muscles is poor. 


The results show that in the 


absence of good proprioceptive information, the presence or absence of 
outflow" information makes a difference in accuracy of localizing an 


object in space. 


The human being continually ac- 
quires and uses information about him- 
Self and his relation to the environ- 
We We are accustomed to thinking 
Fs his information as having been 
a through input to afferent 
the nanisms, That is, we know about 
^s environment through seeing, hear- 
pi touching, and a variety of other 
e or Not the least of these sources 
У nformation is input from proprio- 
i For example, if I am led 

lindfolded into a room and I touch 
ап object in that room with my hand, 
i know where that object is in relation 
9 my body because, on the basis of 
Proprioceptive feedback, I know where 
my hand is. 
Bip is, however, another possible 
Hon ce of information about one's rela- 
b to the environment that has not 
ееп adequately explored. If, in the 
Central nervous system, outgoing motor 
nerve impulses are monitored and re- 
Corded, then information would also 
exist concerning spatial location on the 
asis of this record of efferent impulses, 
NA This research was supported by Grant 
air MH 07835-01 from the National Insti- 

ites of Health to the senior author. We 
Wish to thank Douglas H. Lawrence and 

Ordon H. Bower for their help on the ex- 
Periment. 
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that is, a record of the specific direc- 
tions given to the musculature. This 
information, if it exists, need not rely 
on any current afferent input. To 
make this clear, let us illustrate by a 
loose analogy. Imagine there is a per- 
son who will unconditionally obey your 
orders. Let us also assume that you 
and the other person have had suffi- 
cient previous experience with the en- 
vironment so that you can give him, 
and he can follow, clear directions. 
You tell this person to go to a certain 
specific place and to wait there for you. 
Even in the complete absence of any 
current sensory input you will know 
exactly where that person is because 
you know where you told him to go. 
The question of whether or not such 
monitored efferent information exists is 
an old one in psychology although, of 
late, it has been rarely mentioned. 
Actually, a closely related speculation 
was vigorously debated many years 
ago. James (1950)? stated the issue 


clearly: 


There must, of cour: 
of energy going out 

2 We give the dates o 
from which we have quoted. The book by 
James was originally published in 1890, and 
the first edition of the book by Helmholtz 
was earlier than that. 


se, be a special current 
from the brain into the 


{ the later editions 
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i A his 
riate muscles during the act; and t 

Mode m current (it is supposed) must have 
in each particular case a feeling sui genre 
attached to it, . . . . This feeling о eei 
current of outgoing energy has Mise 
from Wundt the name of the feeling of 
innervation. I disbelieve in its existence, 
and must proceed to criticise the notion of 
it, at what I fear may to some prove 
tedious length [p. 493]. 


If in this statement we replace the 
phrase “feeling of" by “information 
about,” then this old controversy is 
exactly germane to our present ques- 
tion. While we do not intend to en- 
gage in an exhaustive review of the 
argument about "feeling of innerva- 
tion," let us look at the principal data 


about which the disagreement cen- 
tered. 


One major piece of evidence at that 


time is summarized by Helmholtz 
(1925). He states: 


For instance, if the external 


right eye is paralyzed or the 
to it, this eye 
to the right, A 
tinues to turn it in 
regular movements 
rectly the directions 
of view. But the mor 


rectus of the 


he objects appear to move 
У although the adjustment of the 


S of the retinal images 
in it have not varied [p. 245]. 

From this Helmholtz concludes that 
Since there was absolutely no afferent 
change when the eye tried to move to 
the right, and since motion was per- 
ceived as if the eye had moved to the 
right with the retinal ima: 


Бе remaining 
constant, there must be a feeling of 


(information about) innervation, 


William James (1950) quotes other 
data in addition. He says: 


Partial paralysis of the same muscle, paresis, 
as it has been called, seems to point even 
more conclusively to the same inference, that 
the will to innervate is felt independently of 
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all its afferent results. I will quote ri Ve 
count given by a recent authority, 0: ме 
effects of this accident: “When d 
going to an eye muscle, eg, the Er of 
rectus of one side, falls. into a stai ub 
paresis, the first result is that the ps 
volitional stimulus, which under norm aa 
cumstances would have perhaps rotate! ae 
eye to its extreme position outwards, ards 
competent to effect only a moderate ү ice 
rotation, say of 20 degrees. If now, 5 DTE 
the sound eye, the patient looks at an xu Нс 
situated just so far outwards from the den ш 
eye that this latter must turn 20 сат ‘eel 
order to see it distinctly, the patient wi а 
as if һе had moved it not only 20 йге 
toward the side, but into its extreme n 
Position, . . . . The test proposed by йез 
Graefe [1878], of localization by the Sap 
Of touch, serves to render evident the biu 
which the patient now makes. lí we А aa 
him to touch rapidly the object looke aie 
With the fore-finger of the hand of the Sel: 
side, the line through which the finger D de- 
Will not be the line of sight directed 2 eum 
grees outward, but will approach more ne 2 
to the extreme possible outward line 
vision [p. 507]." 


. . b- 
The theoretical relevance of this р 
servation is stated succinctly by Jan 


It appears as if here the judgment наа 
tion could only arise from the ae is 
nervation of the rectus when the objet e 
looked at. All the afferent feelings mus the 
identical with those experienced when rect. 
eye is sound and the judgment is corr the 
The eyeball is rotated just 20 degrees Tls on 
One case as in the other, the image fa s on 
the same part of the retina, the pressure Pe 
the eyeball and the tensions of the skin one 
conjunctiva are identical. There is only our 
feeling that cag vary, and lead us © ток 
mistake. That feeling must be the ТЫ 
which the will makes, moderate in one S ain 
excessive in the other, but in both са 8]. 
efferent feeling, pure and simple [p. 


the 

James then proceeds to ke oe 

interpretations of these observat! was 
Acknowledging that С. E. Müller 


ex- 
the first to propose the rebuttal 
planation, he states : 


Р 15 
Beautiful and clear as this reasoning тогу 
to be, it is based on an incomplete gene a 
of the afferent data. The writers epis the 
Omitted to consider what is going on І 
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other eye. This is kept covered during the 
experiments, to prevent double images, and 
other complications. But if its condition 
under these circumstances be examined, it 
will be found to present certain changes 
which must result in strong afferent feel- 
ings. And the taking account of these feel- 
ings demolishes in an instant all the conclu- 
sions which the authors from whom I have 
w base upon their supposed absence [p. 


James then proceeds to point out 
that the covered, healthy eye does ro- 
tate as directed by the efferent im- 
pulses and thereby provides the affer- 
ent stimulation necessary for the per- 
ception of motion in the Helmholtz 
(1925) example, or the misperception 
of direction in the Graefe (1878) ex- 
ample. Although James, in his ex- 
planation, never copes with the ques- 
tion of why the afferent impulses from 
the covered eye should completely 
dominate the afferent impulses from the 
Open eye, nevertheless his “demolition” 
of the argument for feeling of (infor- 
mation about) innervation appears to 
have been very effective. So per- 
Suasive was the argument by James 
that Mach (1914), who in 1886 had 
argued strongly for the “feeling of in- 
nervation” and presented original ex- 
Periments supporting it, almost com- 
pletely reversed his stand in the fifth 
edition of his book, written in 1906. 
Here he says: 


е theory of James and Miinsterberg fits 
; езе facts, as I think, without апу strain- 

—ng, and we ought therefore to consider it 
as Correct in essentials. The innervation is 
Not felt, but the consequences of the inner- 
Vation set up new peripheral sensible stimuli, 
Which are connected with the execution of 
the movement [p. 176]. 


Rightly or wrongly, James appar- 
ently won the argument, and the issue 
has been a dead one in psychology for 
Many years. Many dead issues do not 
Stay dead, however, and this one has 
Tecently been revived by physiologists. 

ecently von Holst (1954), concern- 
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ing himself with how the organism 
differentiates between self-generated 
movement of a part of the body and 
an identical movement generated by 
external forces, proposed the idea of 
"efference copy.” His idea was that 
incoming afferent signals were matched 
against a temporary copy of outgoing 
efferent signals. If they matched per- 
fectly, the motion involved was en- 
tirely self-generated. This, of course, 
is somewhat different from the idea 
that information from a record of ef- 
ferent impulses is available and used all 
by itself. Nevertheless, it is related 
and served to revive the issue in other 
contexts. 

The question has become particularly 
important to those who are concerned 
with understanding the control system 
for eye movements. Probably the ma- 
jor reason for this is that there is great 
doubt among physiologists that afferent 
signals from the extraocular muscles 
are used to any significant extent in 
the control of eye movements. If 
afferent feedback from the extraocular 
muscles is not useful for determining 
the position of the eye, then it becomes 
convenient for the theoretician to posit 
the existence of information obtained 
from a record of efferent impulses. 

Thus, Fender (1964), discussing the 

possible role of afferent signals of posi- 
tion of the eye, says: 
There is experimental evidence that the 
positioned signal is not used, for if a subject 
views two similar but separately generated 
stabilized images, one with each eye, it 1s 
found that for a short period the two visual 
axes move in conjunction. However, this 
motion quickly breaks down, and the visual 
axes move independently, sometimes getting 
as far apart as 30 deg in the horizontal 
direction and 15 deg in the vertical. There 
is, of course, no binocular retinal-image dis- 
parity to act as a cue in this case, and it 
appears that any positional signal which 
might arise from the extraocular muscles is 
quite ineffective in maintaining the paral- 
lelism of the visual axes [p. 317]. 
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Fender proceeds to incorporate an 
“efferent copy” feedback loop into his 
model of the physiological system con- 
trolling eye movements. 

Whitteridge (1962) recently sum- 
marized the problem as follows: 


The role of extraocular afferent impulses in 
perception is very uncertain. It is self 
evident that we are not directly aware of 
the position of our eyes in the same sense 
in which we are aware of the position of 
our fingers even with the eyes shut. The 
question is whether the position of the eyes 
enters into judgments of position and move- 
ment, and if it does, how far proprioceptors 
are responsible. The alternative theories are 
that information from the volume of out- 
going motor nerve impulses in the oculomotor 
pathways is centrally available—the outflow 
theory, or that impulses from Proprioceptors 
directly signal the state of the eye muscles— 
the inflow theory. The strongest point 
against the inflow theory is that when a pa- 
tient with a paralyzed and therefore immo- 
bile eye tries to turn it to one side, the 
observed visual field moves as though he 
had succeeded in moving the eye, This 
cannot be due to any conceivable change in 
proprioceptive discharge [p. 511]. 


As of 1962, among physiologists, the 
entire controversy seems to have re- 
vived. The issue is now more sophis- 
ticated from a theoretical point of view ; 
but on the empirical side, Whitteridge 
(1962) seems to be back to Helmholtz 
(1925). There is, however, more em- 
Pirical evidence on the issue today 
than there was 60 to 70 years ago. 
Brindley and Merton (1960) report a 
very direct attempt to settle the ques- 
tion as to whether or not there is usable 
proprioceptive feedback from the ex- 
traocular muscles. They anesthetized 
the surface of the eyes and the inner 
surface of the eyelids of Subjects and 
covered the corneas with Opaque caps 
so that the subjects received no visual 
information. They then mechanically 
moved a subject's eyeball by catching 
hold of the insertion of either the 
medial or lateral rectus muscle with 
toothed forceps. When the eye was 


moved in this manner through rota- 
tions of 20 degrees or more, sometimes 
even backward and forward quite 
rapidly, the subject did not know that 
his eye was moving. 

C etn of the argument offered 
by James, they repeated these observa- 
tions moving both eyes simultaneously 
and obtained the same result. Mer- 
ton's (1964) paper, the main purpose 
of which is ©, to reinstate the 
experiments of Helmholtz, which 
proved that no information about the 
position of the eyes is derived from 
Sense endings in the eye muscles [р. 
315], comes to the conclusion : A 
Subject is only conscious of his inten- 
tion to move his eye and does not know 
whether the movement has in fact taken 
Place or not [p. 318].” 

Considering these new data, it seems 
highly likely that Helmholtz (1925) 
Was correct and that James (1950), in 
Spite of having won the argument in 
his day, was wrong. It would be use- 
ful, however, to have additional data 
on the question. After all, the work 0 
Brindley and Merton (1960) demon- 
Strates the absence of a position sense 
in the eye based solely on propriocep- 
tion from the extraocular muscles. „То 
Strengthen the argument опе might 
well desire Positive evidence that infor- 
mation obtained from a record of out- 
going motor nerve impulses is avail- 
able and useful. 

Let us be Specific. If it is true, 45 
seems likely, that we know the position 
of the eye mainly in terms of knowing 
where the eye was directed to go, then 
it should be Possible to show that when 
the eye is directed to go to a specific 
location, a subject knows where his eye 
15 More accurately than if the eye E 
Пуе at the same position withou 
directions Concerning this specific loca- 
tion ever having been issued. h 

The technical problem in doing suc 
an experiment is, of course, the prob- 
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lem of how to devise a method of hav- 
mg the subject move his eyes to a 
Specific location without issuing effer- 
ent signals concerning that location. 
A plausible solution to this technical 
problem may be found in the work of 
Rashbass (1961). He reports a series 
of experiments designed to elucidate 
the relationship between the usual 
Saccadic eye movements and the smooth 
eye movements that occur in tracking 
a target. Several of his findings are 
Important to us here. 

Rashbass reports evidence that sac- 
cadic eye movements and smooth 
tracking eye movements are controlled 
and generated independently of one 
another. Barbiturate drugs serve to 
almost completely disrupt smooth eye- 
tracking movements but do not inter- 
fere with precise saccadic movements. 
Thus, a subject who watched a target 
Which moved horizontally at a rate 
of 3.5 degrees per second ordinarily 
Showed a smooth eye movement before 
the administration of any drug. Aíter 
administration of a barbiturate, Rash- 


bass (1961) states: 


The first noticeable effect was the increase 
in the number of saccadic movements occur- 
ring during the first second of tracking. As 
the amount of drug given increased, the 
Saccadic movements increased at the expense 
of the smooth movements, until, after 8 min- 
Utes, no smooth tracking movement could be 
detected [pp. 333-334]. 


From this and other data, he con- 
cludes that barbiturate drugs make the 
Smooth tracking response inoperative 
but do not interfere with accurate sac- 
Cadic eye movements. Hence the two 
types of eye movements must be sepa- 
Tately controlled. 

Rashbass also reports data from ex- 
Periments designed to discover what 
Produces smooth and saccadic eye 
Movements. The specific question is 

Whether smooth movements аге 
brought about by the position of the 


target’s image on the retina, or by its 
movement over the retina [p. 331].” 
He tests this “by imparting to an ini- 
tially stationary target a displacement 
to one side, and at the same time be- 
ginning a movement of uniform veloc- 
ity toward the opposite side [p. 
331].” The result is stated by Rash- 
bass as follows: 


. after a reaction time during which 
the eye does not move, a smooth movement 
starts in the direction in which the target is 
moving. When this has been established, 
a saccadic movement occurs in the direction 
opposite to the smooth movement to counter- 
act the lead which the eye has over the 
target. . . . This result indicates that the 
smooth movement is stimulated by the move- 
ment of the target irrespective of its position. 
The conclusion that the smooth movements 
are brought about by the movement of the 
target explains the apparently paradoxical 
observation that the first movement which 
the eye makes may take the point of fixa- 
tion further from the target than if no eye 
movement at all were to occur [p. 332]. 


From this and other data, the con- 
clusion is that "the smooth movement 
is stimulated by the direction of move- 
ment and the velocity of the target, 
and the saccadic movement is stimu- 
lated independently by the position of 
the target [p. 333].” 

We have dealt at length with the re- 
sults obtained by Rashbass because 
they are critical for us. They suggest 
that if the eye were brought into a 
given position bya saccadic movement, 
this movement would be a response to 
efferent signals concerning the position 
of the target. ]f, however, the eye 
were brought into that same position 
by a smooth tracking movement, the 
efferent directions would be concerned 
with velocity matching and not pre- 
cisely with target location. И 

А possible experiment suggests itself 
to answer the question concerning the 
availability of information based on 
efference. The experiment would be 
conducted in a completely dark room 
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with the subject’s head fixed so that 
only eye movements could occur. In 
one variation, a light would suddenly 
appear within the visual field of the 
subject, then disappear, and the sub- 
ject would be asked to point to the lo- 
cation where the light had been. In 
this variation, in order to fixate the 
light, a saccadic eye movement would 
occur, and “directions” would have 
been given to the extraocular muscles 
to move from “normal frontal” posi- 
tion to a specific location. If these 
directions to the musculature are moni- 
tored and recorded so as to be avail- 
able as information, the person would 
know the location of the light on the 
basis of knowing where he had sent his 
musculature in order to fixate the light. 
In another experimental variation, 
the light would appear and move 
slowly and smoothly across the visual 
field before coming to a Stop. The 
subject would fixate the light when it 
first appeared and would then track 
the light across the visual field until 
it stopped moving. To the extent that 
only smooth tracking eye movements 
occurred, the musculature would, pre- 
sumably, simply have been directed to 
"follow the light." Thus, in this ex- 
perimental variation, the efferent in- 
formation that existed would contain 
information about the direction of 
movement and the velocity of move- 
ment, but would not include informa- 
tion concerning the specifically desig- 
nated position in which the light had 
stopped. 


In both of the above vari 


ations, of 
course, there would be the same 
amount of proprioceptive information 


concerning where the light was, If the 
subject’s head is clamped so that only 
eye movements are used to fixate the 
target, the only proprioceptive signals 
would come from the extraocular 
muscles. Since these signals are not 
useful, as Merton (1964) has shown, 
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then subjects would know the location 
of the light more accurately when it 
suddenly appeared than when they 
tracked it across the visual field. We 
would, of course, expect more than zero 
knowledge of location in the tracking 
condition. The subject would have 
knowledge about direction and also 
some knowledge of eye position from 
afference from the eyelids. Also, it is 
well known that smooth tracking move- 
ments lag and saccadic movements oc- 
cur periodically. These would also 
provide additional information. If. 
however, information based on effer- 
ence is available, we would expect а 
difference between the two conditions. 
Along with this, of course, one ime 
want to set up another experimenta 
condition in which the subject’s head 
was not clamped so that head move- 
ments could be employed in helping to 
fixate the light. Useful proprioceptive 
input would be expected from the neck 
muscles, and to the extent that the 
Position of the light could be adequately 
known on the basis of these propriocep- 
tive signals from the neck muscles, the 
difference between the two experi- 
mental variations would be expecte 
to vanish. к 
Such an experimental design, using 
two manners of presentation of the 
light and two degrees of adequacy of 
proprioceptive information, should pro- 
vide data that would answer the ques- 
tion as to whether or not outflow 11- 
formation is available and is used. 


PROCEDURE 


Twenty-eight college students, 12 female 
and 16 male, were subjects in the exper 
ment. Each subject volunteered and was 
Paid $1.50 for participating. ^ 

The experiment was conducted in a light- 
Proof room, 'The apparatus consisted of a 
overhead boom fastened to the ceiling m 
its pivot point slightly in front of a ES 
directly over the subject's head. The ШО 
extended 4 feet forward from the pivot point. 
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From the far end of the boom hung an il- 
luminated rectangle that measured 2X3 
inches, The experimenter, standing to the 
side of the seated subject, could move the 
boom noiselessly so that the light was at any 
desired lateral position. The height of the 
light was fixed approximately at the subject’s 
eye level. Calibration at the pivot point of 
the boom enabled the experimenter to read 
the setting of the light in angular deviation 
from straight ahead of the subject. 

On a table directly in front of the subject 
and at a suitable height was a pointer at- 
tached to a calibrated turntable. The pivot 
of the pointer was directly underneath the 
Pivot of the boom. The subject, when point- 
ing to where the light was, or had been, was 
instructed to lay his index finger along the 
йер and move it so that he pointed in 
Бе proper direction. The measuring scales 
E: hee the boom and the pointer were very 
dimly illuminated and shielded from the sub- 
ject. The illumination was sufficient, how- 
ever, to allow the experimenter to read the 
eales in an otherwise totally dark room. 

he target light was also dimly illuminated 
м that there were no problems with after- 
mages, and the target light did not make 
other things in the room visible. 
ae risen of the subjects, seven male and 
oe female, were used in the “eye-move- 
thee eH d „condition. These subjects had 
A ir head in a rigid clamp throughout the 
ra periment so that fixating and tracking the 
ж light could be done only with eye 
І ovement, The head and body were always 
n the directly forward position. The other 
d subjects, 9 male and 5 female, were 
т їп the “head-movement” condition. 
бв т condition was identical to the other ех- 
the that the head was not clamped. Thus, 
к subjects could and did rotate their 
er and even their bodies to some extent, 
iu dition to moving their eyes in fixating 

tracking the target light. 
efore data collection started, each sub- 
кы Was given practice at using the pointer 
ith the target light at various positions. 
This practice was continued until the sub- 
ject was familiar with the situation and the 
Use of the pointer. The actual data collection 
Consisted of 28 trials, 4 trials at each of 
Positions of the light. The positions 
Used were +30, +20, +10, 0, —10, —20, and 
fe degrees (+ referring to positions to 

Ne subject's right, —, to positions to the 
Subject's feft), For one trial in each post” 
‘on the target light was turned on in that 
Position and stayed on. The subject pointed 
to the light while it was still visible. This 
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was intended to yield a measure of the 
accuracy to be expected with optimal infor- 
mation. For another trial in each of the 
seven positions the light was turned on in 
that position, stayed on for 3 seconds, and 
was then turned off. The subject was asked 
to point, after the light was turned off, to 
where the light had been. In this condition, 
outflow information would presumably be 
available to the subject. When the light 
came on, the subject would have to direct 
a saccadic movement of his eyes to a specific 
location and would, hence, know this loca- 
tion at least with respect to a normal frontal 
reference point. 

On the two remaining trials at each of the 
target-light positions, the light moved across 
part of the visual field. The light would 
appear, move slowly (approximately 10 de- 
grees per second) across the visual field 
through an angle of 15, 20, 25, 30, or 35 de- 
grees, and come to a halt at the desired posi- 
tion. The light then remained on in this 
final position for 3 seconds and was then 
turned off. The subject was asked to point 
to where the light had been after it was 
turned off. For each of the seven positions 
the light moved from right to left on one 
trial and from left to right on the other 
trials. These trials were, of course, in- 
tended to be trials on which outflow informa- 
tion concerning target position would be less 
available to the subject. To the extent that 
smooth tracking eye movements would have 
been involved, directions concerning target 
location would not have occurred. 

The decision to keep the light on its final 
position for 3 seconds before turning it off 
was an arbitrary one. We wanted a period 
of time long enough so that in the tracking 
trials there would be no ambiguity about 
when and where the light had come to a 
stop. On the other hand, we wanted the 
period short enough so as to reduce the like- 
lihood of blinking or moving the eyes to a 
forward position and refixating the light. 
Such eye movements would tend to vitiate 
the procedure. Certainly, in 3 seconds such 
movements can occur, but some compromise 
between allowing this and having an un- 
ambiguous final position was necessary. 

The order of trials was arranged in a 
sequence SO that the target light was never 
in the same position on any two consecutive 
trials, and the four different kinds of trials 
were distributed evenly through the series. 
The same order was used for all subjects. 
After the subject had pointed for a trial, he 
was asked to return his hand to his lap. The 


experimenter then recorded the setting of 
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the pointer and moved the boom to the ar 
propriate position for the next trial. | «i 
interval between trials was approximate rn 
seconds. The subject's hand remained in his 
lap until the experimenter said, “All right, 
now point to where the light is (was). 


RESULTS 


We are interested in the magnitude 
of the error made by the subject in 
pointing to the position of the target 
light. The less adequately the person 
knows the position of his eyes, or his 
head, when fixating the light, the less 
accurate should he be in pointing to its 
location afterwards. The simplest cal- 
culation is, of course, to take the abso- 
lute deviation of the pointer position 
from the target position for each trial. 
Thus, if the target was in position -+20 
and the subject set his pointer to +16, 
this would be an error of 4 degrees, 
Table 1 presents the results from the 
experiment based on this simple cal- 
culation. 

Even a cursory look at the data in 
Table 1 reveals that the obtained data 
are of the kind one would expect if, 
indeed, proprioceptive input from the 
extraocular muscles js Poor and the 
person has available, and uses, outflow 
information. When only eye move- 
ments are allowed, that is, when the 
head was clamped, the error of point- 


TABLE | 


AVERAGE ABSOLUTE ERROR (х DEGREES) 
OF POINTING TO TA RGET LIGHT 


Type of trial 
Light off 
Condition ud 
Lili a 
on i- | Tracked] Tracked 
LEE 3E] 
right left 
Eye move- 
ment only |3.06| 3.54 5.24 5.55 
Head move- 
ment 2.13] 3.92 | 3.69 3.35 


ing when the light suddenly api n 
at the designated position was өшү 
slightly worse than when the light wa 
on while the person was pone 
However, when the subject oe 
the light across the visual field, an 
thus would not have relevant outflow 
information, the error of pointing e 
considerably greater. It is also ge 
that when head movements are a 
lowed, the results are very rr 
The "tracking" trials are then slightly 
superior to the “at position’ E К 
We have presented these data ln 
cause some readers might consider t ^ 
the proper measure to use. We M 
not engage in extended discussion a 
Table 1, however, nor present ут ра 
cal analyses, since it seems to us м 
а more accurate measure should 
used. The absolute error of родо 
is, of course, affected by constant : 
Tors. One subject may consisten y 
point somewhat to the right - ft 
target, another consistently to the d E 
Such constant errors are probably ү 
to coordinating the physical act. i 
pointing with knowledge of ане 5 
and probably should be disregarded |. 
our calculations, Actually, there wa 
an average constant error of poininng 
somewhat to the left of the position i 
the target. Over all types of pues 
this average constant error was 1.6 id 
grees to the left in the , SSE T 
only” condition and .1 degree to: th 
left in the “head-movement” conditor 
The probable reason for the direction 
of the constant error in the s 
ment-only” condition is that, using En 
tight hand, the hand position W 
more comfortable along the p 
Pointer when pointing toward the re 
than when pointing toward the s С 
Apparently, head movements provi z 
enough additional orientation to elim 
nate this constant error. n. 
There is also another source of E 
Stant error in the data, Two types 
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tracking trials, one from the left, one 
from the right, were used because of 
the possibility that the memory of 
where the light had stopped might be 
affected by the direction in which the 
light had moved. This, indeed, turns 
out to be the case. In the tracking 
trials, the subjects tend to point a bit 
more in the direction from which the 
light had come. Thus, in the “eye- 
movement-only” condition the con- 
stant error is —1.2 degrees when the 
light came from the right, but —2.2 
degrees when the light moved from the 
left. Similarly, in the “head-move- 
ment” condition the corresponding 
constant errors are +1.1 and —.6. 
The difference between the two types 
of tracking trials is not quite signifi- 
cant statistically for the “eye-move- 
ment-only” condition (f= 1.44) but 
15 significant at the 2% level for 
the “head-movement” condition (t= 
2.85). 

Clearly, we do not want to have our 
measure of accuracy of pointing con- 
taminated by these various sources of 
constant error. We, therefore, com- 
puted a “corrected absolute error” of 
Pointing by taking into account for 
each subject, for each type of trial, the 
Constant error in the data. Thus, for 
example, a subject may have had a 


TABLE 2 


AVERAGE CORRECTED ABSOLUTE ERROR (IN 
EGREES) or POINTING TO TARGET LIGHT 


Type of trial 
Light off 
Condition 
Light Tracked| Tracked 
on " racket racket 
AT poa from from 
"i right left 
Eye move- " 
ment only | 2.58] 3.11 4.38 | 4.50 
Tead move- 
ment 1.95| 3.64 | 3.11 2.79 
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constant error of 2 degrees to the left 
on the seven trials on which the target 
was tracked in from the left. If this 
subject set his pointer at —24 degrees 
when the target light had actually 
stopped at —20 degrees, his corrected 
absolute error on this trial was 2 de- 
grees. Table 2 presents the data using 
this measure. 

These corrected data show the same 
overall pattern of results as the data 
using the uncorrected absolute error. 
We will discuss these data in detail, 
presenting appropriate statistical anal- 
yses. 


Eye-Movement-Only Condition 


It is clear that when only eye move- 
ments are permitted, localization of the 
target light is better when the light 
suddenly appears at its final position 
than when it is tracked to its final 
position. An analysis of variance 
yields a highly significant F value 
(8.71, df = 3/39) for the variance 
among the means of the different types 
of trials. The variance among sub- 
jects is also significant (F — 2.68, df 
= 13/39). This latter, of course, 
simply means that some subjects are 
consistently more accurate than others 
in pointing to the target light. 

The difference in accuracy between 
the "light-on" and “light-off-at-posi- 
tion" trials is not significant (t= 
1.10). The mean for each is, how- 
ever, significantly different from the 
mean for each of the "tracking" trials, 
the smallest ¢ value being 3.86 between 


the  "at-position" mean and the 
“tracked-from-right” mean. In n 
tted, 


with only eye movements perm! 
pointing to the target when it sud- 
denly appeared at its final position 15 
not materially less accurate than when 
the pointing was done while the light 
was still on. In the tracking condi- 
tions, however, when relevant outflow 
information was presumably not avail- 
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able, accuracy is materially and sig- 
nificantly worse. 


Head-Movement Condition 


When head movements are allowed, 
the data present quite a different pat- 
tern, although significant differences 
still exist among the different types of 
trials. The variance of the means for 
the different types of trials and of the 
means for subjects both yield highly 
significant F values (7.30, df = 3/39; 
and 6.56, df = 13/39) 

With head movements, the accuracy 
of pointing with the light still on is 
Significantly better than each of the 
three conditions in which the pointing 
was done after the light was off. The 
important differences to us, however, 
are between the “at-position” trials 
and the “tracking” trials, Here we 
find that the “at-position” accuracy is 


no longer better, but is actually worse 


than the accuracy of pointing on the 
“tracking” 


trials. The two ; values 
are 201 and 2.19 which, for df = 13, 
are each significant at about the 5% 
level. We had not anticipated this, 
and we are not certain of the reason 
for it. It may simply be that occa- 
sional inattention affected accura 
the “at-position” trials. There was no 
warning of when the light would ap- 
pear. In the “tracking” trials, the pe- 
riod of tracking could minimize the 
effects of any inattention. Tt is clear, 
however, that when head movements 
are allowed, thus making available 
good proprioceptive input concerning 
position, the availability of relevant 
outflow information no longer produces 
greater accuracy, 


cy in 


Comparison of the Two Conditions 


If we compare the accuracy between 
the condition in which only eye move- 
ments were allowed and the condition 
in which head movements were also 
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allowed, we see that in the latter con- 
dition there is a general tendency to 
be more accurate. When the light is 
on while pointing, the average cor- 
rected error decreases from 2.58 to 
1.05, a difference significant at the 
10% level (t= 173, df = 26). The 
data for the "tracking" trials also show 
much less error with head movement 
allowed. The two : values here "€ 
2.11 for "tracking from the right" an 

2.98 for "tracking from the left," sig- 
nificant at the 5% and 1% level re- 
spectively, -—" 

Only for the "light-off-at-position 

trials is there no improvement | or 
the “eye-movement-only” to the kats 
movement” condition. The actual p 
ference is slightly in the opposite di- 
rection but is negligible (2 = .81). In- 
deed, it seems as though the presne 
of relevant outflow information abon 
eye position in the gee erp 
only” condition is just as good as the 
Presence of the same outflow inerme” 
tion plus good proprioceptive input 1 
the “head-movement” condition. It E 
clear also that, when there is good р 
Prioceptive input and no relevant ou 

flow information, as in the tracking 
trials with head movements, pe iem. 
is at least as good as when relevan 
outflow information is also present. 
This would tend to imply that, in this 


"t Я of 
Situation, there is some redundancy 
information, 


Discussion 


The main conclusion we would like 
to draw from the results of the ex- 
Periment is that information based on 
Some kind of record of efferent К” 
pulses (ie, outflow information) 15 
available to the organism. The majo 
Tesult on which we wish to base this 
conclusion is the finding that, Beg 
Only eye movements were ym 
farget localization was more esie oe 
When the target suddenly appearec ? 
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its final position than when it was 
tracked to that position. 

_ Let us review the line of reasoning 
involved in coming to this conclusion. 
. Accuracy of localization of an ob- 
ject in space with respect to one's body 
depends on knowledge of body, head, 
and eyeball position. If the head and 
body are fixed, the only variable is 
position of the eyeball. 

There is evidence that the position 
of the eyeball is not adequately known 
on the basis of proprioceptive signals 
from the extraocular muscles. Hence, 
with head and body in a fixed position, 
accuracy of localizing an object in 
Space would be poor if the only infor- 
mation about eyeball position came 
from such proprioceptive signals. 

_ There is evidence that smooth track- 
ing movements of the eye are con- 
trolled and directed by the direction 
and velocity of movement across the 
Tetina and not by target location. Sac- 
Cadic eye movements, on the other 
hand, are directed on the basis of 
target location on the retina. Hence, 
if a target is fixated by means of a 
Saccadic movement, efferent signals 
relevant to target location would have 
been issued. Ifa target is tracked by 
à smooth eye movement, however, ef- 
ferent signals concerning direction and 
velocity of movement would have 
been issted—information not optimally 
useful for knowing the target location. 
‚ Consequently, if a record of efferent 
Signals js available, localization in 
Space of a target should be better fol- 
lowing fixation by a saccadic eye move- 
ment than following a smooth tracking 
eye movement. Having found this re- 
sult, we regard it as evidence for the 
existence of information based on this 
hypothesized record of efferent signals. 

It is, of course, possible that there 
are alternative interpretations of the 
data we have presented. No such 
Plausible alternatives occur to us, how- 
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ever. It does not, for example, seem 
possible to maintain any alternative in- 
terpretations in terms of confusion in- 
troduced by the tracking procedure, 
since it is clear, in the “head-movement 
condition,” that the tracking procedure, 
in and of itself, does not interfere with 
accuracy. 

Another possible alternative expla- 
nation could be elaborated as follows. 
Presumably, during the period of 
darkness between trials, the subject’s 
eyes revert to some “normal” frontal 
position. Such a normal position is 
probably a reference point for location 
in the visual field, and, presumably, 
directions are issued to the extraocu- 
lar muscles with respect to some such 
reference point. The eyeball then 
moves, in accordance with the effer- 
ent directions, in a saccadic, ballistic 
movement. Under such circumstances 
the initial movement of the eye to 
fixate the target is not a continuously 
controlled movement. Once started it 
proceeds to its destination. The sac- 
cadic movement, hence, must have a 
complete set of directions issued at the 
beginning. 

It thus becomes clear that, in order 
to issue directions that are relatively 
accurate for the initial ballistic move- 
ment of the eye, information as to the 
location in space of the target must 
exist before the directions are issued. 
And indeed, this information must be 
obtained on the basis of the stimula- 
tion of the periphery of the retina 
when, with the eyes in frontal position, 
the target light suddenly appears. It 
is on the basis of this information that 
the initial ballistic eye movement 15 
more or less accurately directed. 

Why, then, is it necessary to say 
that the differences obtained between 
the *eye-movement-only" conditions 
are due to à record of the efferent 1m- 
pulses actually sent out to the muscles? 
Why could we not simply maintain 
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that the information the person has as 
to the location of the target light is 
simply the information on the basis of 
which the efferent directions were 
issued? After all, on the tracking 
trials the subject did not see the target 
light at its final position in peripheral 
vision. The results of the “head- 
movement" conditions rule out this 
explanation of the results. If seeing 
the target light in peripheral vision 
were important, the tracking condi- 
tions should still be inferior even with 
head movements allowed. 

One must admit, however, that in- 
formation based on a record of the ef- 
ferent signals is not likely to be better 
than the information on the basis of 
which those efferent signals were sent. 
Our present data cannot answer ques- 
tions concerning the relation between 
these two things. Our experiment 
does, however, confirm the existence, 
and usefulness, of outflow information, 
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CENTRENCEPHALIC THEORY AND 
INTERHEMISPHERIC TRANSFER 
OF VISUAL HABITS' 


ROBERT THOMPSON 


Louisiana State University 


Recent anatomical and psychological data suggest that visual pattern- 
discrimination habits are mediated by a direct occipito-mesencephalic 


tract, while a simple brightness-discrim 
occipito-pretecto-tegmental pathway. 
By assuming that the me 


terminal endings of the occipito-fugal pat 
in the presence or absence of inter- 


homolateral. 


response, it is possible to expla 


ocular | transfer in split-brain animals. 
extension of Penfield's centrencephalic t| 


ination habit is mediated by an 
Both of these projections are 
mory trace develops at the 
hway subserving the learned 


This scheme represents an 
heory and a defense for the 


possible existence of a subcortically induced memory trace. 


" In recent years, research on the 
.Split-brain" animal has revealed an 
Sn a function of the corpus cal- 
osum in visual discrimination learning 
Myers, 1961; Sperry, 1961). This 
unction has to do with intercommuni- 
[oum between the two hemispheres of 
earning experiences restricted to only 
one side of the brain. More generally, 
the data emerging from split-brain ani- 
pes have been interpreted (Bureš & 

uresová, 1960; Russell & Ochs, 1963) 
as inimical to a centrencephalic theory 
(Penfield, 1954а) and to any other 
heory having as one of its tenets the 
subcortical origin of the memory trace 
(ев, Gastaut, 1958). Myers and 
perry (1958) expressed a similar 
view: 


gliese facts point to cortex (and associated 
thalamic nuclei) as the probable site of 
elaboration of visual sensory input. The 
contribution of brain-stem mechanisms to 
such high-level, “psychic” activity has yet 
to be fully explored. Present evidence, how- 
ever, points toward involvement of brain 
Stem in preparing and maintaining levels of 
activity in the cortex necessary for such 


7-01 from the 


1 Aided by Grant MH-0837 Tie 
th, Unite: 
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high-level functions, rather than toward a 
primary involvement in these functions. Ac- 
cordingly, it is assumed in the following dis- 
cussion that the memory mechanisms under- 
lying the involved pattern discrimination 
responses reside somewhere within the corti- 
cal gray mantle, or at least can be grossly 


disorganized by appropriate cortical insult 
[p. 301-302]. 


In the current paper, some recent 
anatomical and psychological data will 
be discussed in relation to Penfield's 
centrencephalic theory. Taken as a 
whole, these data strengthen Penfield's 
position that occipito-fugal pathways 
terminating within the brain stem are 
functionally significant in visually 
guided behavior. By assuming that 
the memory trace develops at the ter- 
minal endings of these cortico-fugal 
projections, it is possible to incorporate 
the findings of interhemispheric trans- 
fer of visual learning within the frame- 
work of a centrencephalic model of 
brain function. Contrary to current 
views, the absence of transfer of a pat- 
tern discrimination in split-brain ani- 
will be shown to argue more 
for a subcortically induced 
an it does for a cor- 
ry trace. 


mals 
strongly 
memory trace th 
tically induced memo 
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PENFIELD’S CENTRENCEPHALIC 
THEORY 


One of the principal proponents of 
the view that subcortical processes take 
precedence over cortical processes in 
the integration of the total activity of 
the brain is Penfield (1952, 1954a, 
1954c, 1958). According to Penfield, 
there exists within the brain stem (por- 
tions of the diencephalon, mesencepha- 
lon, and probably the metencephalon) 
an ensemble of nuclei and circuits 
which serves to coordinate and inte- 
grate the activities of the cerebral 
hemispheres. This centrally located, 
integrating ensemble is termed the 
“centrencephalic system” (Penfield, 

1952). Functionally, it represents 


a ganglionic area in which that Stream of 
nervous impulses must arise that produces 
voluntary activity, an area in which the 
sensory pathways culminate in neurone cir- 
cuits and in which the information relative 
to past experience is made available, an area 
in which those nervous mechanisms are to be 
found which are prerequisite to the existence 
of intellectual activity and prerequisite to the 
initiation of the patterned stream of efferent 
impulses that produce the planned action of 
the conscious man [Penfield, 1954c, p. 286]. 


"Thus, with Tespect to visually guided 
behavior, Penfield conceives visual im- 
pulses as being relayed to the centren- 
cephalic system once they reach the 
occipital cortex, By virtue of this Sys- 
tem's vast interconnecting circuitry, 
engrams relative to this visual input 
are excited and compared with present 
experience. On the basis of this in- 
formation, the System discharges a par- 
ticular pattern of “volitional” impulses 
to cortical and subcortical motor 
mechanisms, 
The evidence that Penfield cites in 
support of the subcortical origin of the 
central integrating system is largely 
composed of clinical observations. 
Particularly relevant to Penfield’s posi- 
tion are the well-established findings 
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that small interferences with the brain 
stem either by undue pressure, injury, 
or local epileptic discharge abolish con- 
sciousness and purposeful action, while 
extensive cortical excisions leave these 
“psychical” activities intact (Penfield, 
1957). Another source of evidence 
focuses on the manner in which the 
precentral motor gyrus is activated. 
According to Penfield (1954a), it is 
untenable to envisage the motor area 
as being functionally excited by trans- 
cortical pathways since ablation of cor- 
tical tissue surrounding the precentral 
gyrus fails to interfere with the pro- 
duction of skilled movements. The 
only alternative is that the motor cor- 
tex is under control of a ganglionic 
mass residing beneath the cortical gray 
mantle. Consistent with this possi- 
bility are the findings relative to the 
local abolishment of precentral beta 
rhythms during voluntary action (Pen- 
field & Jasper, 1954). 

In his earlier writings, Penfield 
(19545) concluded that the temporal 
cortex was the site of the memory 
trace. This conclusion has since been 
withdrawn in view of the absence of 
major amnestic effects following bi- 
lateral temporal resections in humans 
(Penfield, 1959), While Penfield does 
not explicitly advocate a subcortical 
memory trace, he does give some hint 
of this belief in his “Horowitz” lecture- 


In discussing speech mechanisms, Pen- 
field (1963) stated that: 


This voluntary outflow is not transcortical 
from speech cortex to motor cortex as was 
Sometimes assumed. It is through the ee 
trencephalic system. And the “hook-up” Le 
tween non-verba] concept and word-idea mus 
have been through that system [p. 45]. 


Somer Recent Supportive DATA 


It will be noted that Penfield has not 
Specifically addressed his theory to the 
field of animal learning. It was origi- 
nally adopted as a working hypothesis 
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with specific application to clinical 
neurological cases. One must hasten 
to add, however, that centrencephalic 
theory stands in good stead with recent 
neuroanatomical findings (Russell, 
1961) and receives considerable sup- 
port from animal studies relative to 
visuomotor behavior (Myers, Sperry, 
& McCurdy, 1962). 

Research from our laboratory on the 
albino rat not only points to the exist- 
ence of a centrencephalic mechanism 
underlying visual discrimination learn- 
ing, but sheds some light on possible 
functional pathways linking the oc- 
Cipital cortex with the brain stem 
(Thompson, Rich, & Langer, 1964). 
During the past 8 years, my associ- 
ates and I have been surgically remov- 
ing portions of the brain in previously 
trained rats in an effort to locate those 
nuclei and pathways which maintain a 
brightness-discrimination habit. Dur- 
ing the course of this research, vir- 
tually every part of the tel-, di-, mes-, 
and metencephalon has been examined 
as to its contribution to the perform- 
ance of the discriminative response. 
The only lesions which consistently 
yielded positive effects include the vis- 
Ual cortex, pretectum (anterior region 
of the nucleus posterior thalami of 
Gurdjian, 1927), lateral posterior 
hypothalamus, and ventral mesencepha- 
lon (Thompson, 1963a). That these 
data suggest a centrencephalic mecha- 
nism is indicated by the very nature of 
the anatomical connections existing 
between these critical structures. First 
Of all, there is general agreement that 
the occipital cortex of rodents projects, 
in part, to the nucleus posterior 
(Combs, 1949, 1951; d'Hollander, 
1922). Secondly, the nucleus posterior 
has been shown to send a fiber pro- 
jection to the ventral mesencephalon 
(Krieg, 1947; Papez & Freeman, 
1930). Finally, the lateral-hypothala- 
mic and ventral-mesencephalic areas 


are anatomically interrelated (Crosby 
& Woodburne, 1951; Krieg, 1932) and 
very probably contain the ganglionic 
groups and circuits necessary for the 
integration and execution of locomotor 
behavior (Hinsey, Ranson, & McNat- 
tin, 1930; Keller, 1932; Waller, 1940), 
the defence reaction (Abrahams, Hil- 
ton, & Zbrozyna, 1960; Bard, 1928), 
and other somatic behavior patterns 
(Woods, 1964). Taken as a whole, 
these data suggest that the performance 
of a discriminative response to stimuli 
differing in brightness requires a relay 
of the visual input to the brain stem 
by an occipito-pretecto-mesencephalic 
pathway.* 


CorticoruGcaL PATHWAYS MEDIATING 
PATTERN- AND BRIGHTNESS- 
DISCRIMINATION HABITS 


According to Penfield’s theory, vis- 
ual impulses are channeled to certain 
regions of the brain stem once they 
have reached the occipital cortex. 
While several recognizable pathways 
are available to effect this relay (see 
Meikle & Sprague, 1964), only two 
seem to be functionally significant in 
discrimination learning. One of these 
pathways has already been mentioned. 
It is indirect and extends from the oc- 
cipital cortex to the ventral mesen- 
cephalon by way of the pretectal area. 
The first component of this pathway, 
the occipito-pretectal projection, has 
been identified in all mammals studied 
(Altman, 1962; d’Hollander, 1922; 
Nauta & Bucher, 1954). The second 
component, the so-called “pretecto- 
tegmental tract,” is found in submam- 
malian species (Clark, 1932) as well 


2In a recent paper, Rich and Thompson 
(1965) outlined a centrencephalic mecha- 
nism underlying the performance of a condi- 
tioned avoidance response. Unlike the con- 
ventional visual discrimination habit, the 
avoidance habit is highly dependent upon the 
limbic system (Thompson et al, 1964). 
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as in the opossum (Bodian, 1940; 
Tsai, 1925), the armadillo (Papez, 
1932), the rat (Krieg, 1947; Papez & 
Freeman, 1930), the cat (Bucher & 
Birgi, 1952), the monkey (Clark, 
1932), and even the human (Kuhlen- 
beck & Miller, 1949). The majority of 
these reports disclose that the pretecto- 
tegmental tract terminates in the region 
of the red nucleus and substantia nigra. 
The second corticofugal pathway 
which functionally relates the occipital 
cortex with the brain stem is a direct 
one. This occipito-mesencephalic tract 
has recently been demonstrated in rats 
(Valverde, 1962) and cats (Pearce, 
1960). It very probably exists in 
monkeys as well (Jasper, Ajmone- 
Marsan, & Stoll, 1952; Mettler, 1935). 
In the rat, the projection is chiefly to 
the basal half of the midbrain tegmen- 
tum, while the projection in cats seems 
to be limited to the dorsal half (the 
subcollicular area). This corticofugal 
projection is homolateral, a finding to 
be elaborated upon later in the dis- 
cussion of the behavior of Split-brain 
animals. 
Several lines of evidence suggest 
that the direct occipito-mesencephalic 
tract functions Primarily in pattern-dis- 
crimination learning, while the Occipito- 
Pathway functions 
ghtness-discrimina- 
st of all, lesions of 


; Schwartz & Clark, 195 
(Myer, 1963; Smith, 1938). zii еме 
(Marquis, 1934) to perform success- 
fully on a pattern discrimination, Sec- 
ondly, pretectal damage has a signifi- 
cantly greater disruptive effect on a 
simple brightness discrimination than 
on a pattern discrimination (Thompson 
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& Rich, 1963). Finally, cats subjected 
to lesions in the subcollicular region of 
the midbrain to which the occipito- 
mesencephalic tract terminates have 
considerable difficulty in the perform- 
ance of visual pattern discriminations 
(Blake, 1959; Myers, 1964; Sprague, 
Chambers, & Stellar, 1961) but suc- 
cessfully retain a simple brightness 
discrimination (Blake, 1959). A 
Further evidence in support of this 
functional dichotomy is afforded by the 
following experiment recently per- 
formed in our Louisiana laboratory. 
Adult albino rats were trained pre- 
operatively on either a simple bright- 
ness (black card versus white card) 
discrimination or a pattern (vertically 
striped card versus horizontally striped 
card) discrimination. Since the appa- 
ratus, training, surgical, and histologi- 
cal procedures were quite comparable to 
those reported previously (Thompson 
& Rich, 1963), it is possible to dispense 
with a detailed description of the in- 
vestigation. Following learning, the 
majority of animals were subjected to 
bilateral electrolytic lesions of the 
caudal mesencephalon, In some cases, 
a special effort was made to damage 
the basolateral midbrain area (nucleus 
cuneiformis) to which the direct oc- 
“ipito-mesencephalic tract terminates 
(Valverde, 1962). After a recovery 
period ranging from 2 to 3 weeks, 
the control and operated rats were Xe 
quired to relearn the discrimination 
habit that was learned preoperatively- 
Individual retention scores were €X- 
Pressed in terms of percentage of er- 
ТОГ savings, and the significance © 
Sroup differences calculated by a non- 
Parametric test (Festinger, 1946). | 
Table 1 summarizes the original 
learning and retention scores for the 
various groups differentiated on the 
basis of the site of the lesion. A total 
of 16 rats initially trained on the pat- 
fern discrimination received bilatera 
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TABLE 1 


MEAN LEARNING AND RETENTION SCORES FOR ALL GROUPS 


Pattern discrimination 


| 
| Brightness discrimination 
| 


Group А " " 
Learning Retention Learning Retention 
N Errors savings N Errors savings 
(9%) (9%) 
Control 6 23.8 4 7.5 100 
Basolateral tegmentum 4 17.8 5 8.8 89.0 
Lateral tegmentum ó 20.2 
Subcollicular area E 14.3 
3 18.3 


Intercollicular area | 


^ Significantly different from the controls at the .01 level. 


midbrain lesions. The ventral portion 
of the nucleus cuneiformis along with 
a caudolateral segment of the sub- 
stantia nigra were damaged in four ani- 
mals. Two of these animals were un- 
able to relearn the task even after hav- 
ing received twice the number of trials 
required preoperatively. Тһе re- 
Maining two animals earned savings 
Scores of —10% and 50%. As a 
group, these four operated rats were 
inferior to the six controls at the .01 
level of significance. Figure 1 (Parts 
A, B, and C) illustrates the lesion of 
One rat which failed to relearn the 
Pattern discrimination. Those two 
animals which succeeded in reacquiring 
the habit suffered appreciably smaller 
lesions of the nucleus cuneiformis than 
those which failed to relearn. It is in- 
teresting to note that at the time of 
the postoperative test, these four ani- 
mals appeared quite normal in every 
Tespect. No obvious motor or sensory 
deficits were ever observed. The only 
disturbance appearing in these four 
animals was an aphagia which lasted 
for the first 3 postoperative days. 
An additional six animals sustained 
lesions of the lateral tegmentum which 
damaged, in part, the dorsal portion of 
the nucleus cunciformis. All earned 
Savings scores in excess of 83%. 
Finally, six rats subjected to extensive 


destruction of the subcollicular area or 
the intercollicular zone earned perfect 
retention scores. Figure 1 (Parts D, 
E, and F) summarizes the various sites 
of the lesions examined in this experi- 
ment. With respect to these three 
levels of the midbrain, only damage to 
the basolateral tegmentum yields sig- 
nificant effects on retention. 

That the ventral portion of the nu- 
cleus cuneiformis appears to function 
specifically for pattern-discrimination 
tasks is indicated by the performance 
of five rats trained preoperatively on 
the simple brightness discrimination. 
These five animals sustained lesions 
similar to those shown in the left col- 
umn of Figure 1. Three achieved 
perfect savings scores, while the re- 
maining two earned scores of 67% and 
78%. As a group, these five operated 
animals were not significantly inferior 
to the control group. It is noteworthy 
to report that the two animals which 
earned subperfect retention scores sus- 
tained midbrain lesions which were 
considerably larger than those suffered 
by the two rats which failed to relearn 
the pattern discrimination. 

Before leaving this section, three 
comments must be made in connection 
with these corticofugal pathways in- 
volved in visual discrimination learn- 
ing. The first comment is concerned 


Fic. 1. Photographs and camera drawings 
of frontal sections showing lesion placements, 


(Parts A, B, and C represent photographs 
of unstained sections showing a basolateral 
midbrain lesion which prevented relearning 
of a pattern discrimination in Rat 11M. 
Parts D, E, arid F Tepresent drawings of 
corresponding midbrain levels showing com- 
posites of lesions investigated in this experi- 
ment. Stippled areas, Parts E and F, indi- 
cate the rostral midbrain Projection of the 
occipito-mesencephalic tract. Areas enclosed 
by heavy solid lines denote positions of 
lesions preventing relearning. Areas en- 
closed by heavy interrupted lines denote posi- 
tions of lesions yielding no loss in retention. 
"The enclosed lesioned areas represent regions 
of total destruction only. Abbreviations : 


CG, central gray; IC, inferior colliculus ; 
IPN, interpeduncular 


nucleus; LM, medial 
lemniscus ; NP, pons; RN, red nucleus; SC, 
superior colliculus ; ПІ, nucleus of third 


nerve.) 


with the contribution 
occipito-fugal pathway in learning a 
difficult brightness discrimination. Tt 
now appears that the performance of 
a difficult (near-threshold) brightness 
habit may involve the utilization of the 
direct occipito-mesencephalic tract to 
a greater extent than the indirect path- 


of the direct 
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way. This conclusion is based A 
the findings that rats (Bauer & ecu 
1964 ; Lashley, 1930) and cats (Sii = 
1937) lacking the striate cortex od 
hibit increasing deficits in m » 
when the difference between the brig "4 
ness discriminanda is иа. 
Rats with pretectal lesions, in contra я 
exhibit no greater deterioration. i 
near-threshold brightness due 
tions as compared with — Г 
brightness discriminations fiue en 
& Rich, 1963). In fact, mti A 
lesions seem to produce more mar ue 
effects on a black-white isor puna en 
(Thompson & Massopust, 1960) Pen 
on a medium gray-white discrimi 
tion (Thompson & Rich, 1963). ith 
The second comment has to do m 
the dispensability of the occipito-p к 
tectal projection in the performance ell 
certain simple visual habits. It is p 
established that animals without i 
Striate cortex can learn a simple n 
ness discrimination as fast заз 193 4; 
mals (Lashley, 1929; Marquis, bla 
Smith, 1937), Similarly, total ? not 
tion of the occipital cortex may E 
Seriously affect the performance К, 
either а discrimination habit 
upon differences іп luminous id? 
(Bauer & Cooper, 1964) or an | by 
ance-conditioned response pos E 
a visual signal (Marquis & EDI 
1936; Meyer, 1963; Thompson, ssive 
Wing & Smith, 1942), Since ma: ees 
pretectal damage may destroy es imas 
ability to develop simple brig 19 ; 
discriminations (Layman, айй 
Thompson & Massopust, 1960) erm 
avoidance responses to visual s vide 
(Thompson, 1963b), the pretecto ridi 
mental tract may, under certain С епк 
tions, mediate visual habits ec 
of the corticofugal ee ol- 
Possibility is strengthened by s eod 
Servation that striate rats, upon ation; 
ing a simple brightness dieit 
exhibit a significant retention 
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following bilateral damage to the pre- 
tectum (Thompson & Rich, 1963). In 
this case, the visual input to the pre- 
tectum may be transmitted either by 
direct optic fibers terminating within 
the Pretectal area or by indirect asso- 
ciation fibers originating in the lateral 
geniculate or tectal nuclei (Meikle & 
Sprague, 1964). 

The final comment deals with the 
pee mechanisms underlying the 
unctional dichotomy of the occipito- 
fugal systems. According to the recent 
ae of Hubel and Wiesel (1962), 
E of the cells composing the striate 

ex are maximally responsive to 
panegi of light stimuli (lines and 
T and minimally responsive to 
поа illumination. It is only a 
са нро нена to the assumption 
stis pe cells responsive to patterned 
АЫ i excite almost exclusively the di- 
this ссоыееечеркерНаПе tract. Thus, 
[e bee may mediate form-discrimina- 
a bits by virtue of its inherent con- 
р, with particular occipital cells. 
= е changes in general illumination 
ed relatively ineffective іп evoking 
omen responses, these stimuli are as- 
Mem ed to activate primarily the pre- 
ce ae tract. The afferent 
he ү this pathway may very prob- 
fight »e the same as that involved in the 

a reflex ( Ranson & Magoun, 1933). 
CA pretecto-tegmental tract, there- 
Wa акр mediate conditioned responses 
of ‘lie by changes in the general level 
ү umination. With respect to visual 
ав. discrimination, no clear-cut 
Clectrophysiological data are available 
O suggest possible correlates. How- 
ever, it is not inconceivable that the 
Organization of the visual cortex is 
Such that the occipito-pretecto-tegmen- 
tal pathway is activated by striate cells 
Sensitive to gross flux differences, and 
the occipito-mesencephalic tract is ex- 
Cited by striate cells sensitive to slight 

ux differences. As a consequence, 
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suprathreshold and — near-threshold 
brightness-discrimination habits would 
be subserved by different occipito-fugal 
pathways. 

Let us now turn to some data which 
are customarily considered to be at 
variance with a centrencephalic theory. 


INTERHEMISPHERIC TRANSFER 
EXPERIMENTS 


A normal cat trained monocularly on 
a visual pattern-discrimination prob- 
lem will exhibit excellent transfer of 
learning when the untrained eye is 
utilized and the trained eye is covered. 
This transfer is also displayed in cats 
in which the crossed optic fibers have 
been cut (Myers, 1955). If, however, 
the corpus callosum and the crossed op- 
tic fibers are ‘severed, no transfer is 
observed (Myers, 1961; Sperry, 
Stamm, & Miner, 1956). 

These data seemingly detract from 
Penfield’s theory in two respects. 
First, Penfield assumes that the activi- 
ties of the cerebral hemispheres are 
coordinated and integrated through 
symmetrical ascending connections of 
the brain stem. This coordination and 
integration is conceived to be inde- 
pendent of the corpus callosum. Nu- 
merous experiments have recently been 
reported which attest to the coordi- 
nating function of the callosal system 
(Myers, 1961; Sperry, 1961). Sec: 
ondly, if one assumes correctly that 
Penfield subscribes to a subcortical 
memory trace, then Penfield’s cen- 
trencephalic theory demands that the 
memory trace should be bilaterally rep- 
resented within the brain stem. This 
follows from the assumption that each 
hemisphere has bilateral descending 
connections with the brain stem. 
However, the absence of interocular 
transfer of pattern-discrimination 
learning in split-brain animals makes 
it clear that a bilateral subcortical en- 
gram is not formed. 
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Concerning the first point, Penfield 
must accept the fact that callosal fibers 
do function under certain conditions to 
interrelate the activities of the cere- 
bral hemispheres. But this fact does 
not in any way speak against further 
coordinating and integrating functions 
of the brain stem. It has been shown, 
for example, that callosal fibers do 
not function critically in connection 
with interocular transfer of a simple 
brightness-discrimination habit (Mei- 
kle, 1960; Meikle & Sechzer, 1960). 
Additional studies demonstrate that 
the callosal system does not mediate 
learned responses in which the visual 
input is confined to one hemisphere, 
while the limb performing the re- 
sponses is governed by the contralat- 
eral hemisphere (Voneida, 1963). It 
has also been reported that the split- 
brain monkey can perform a visuo- 
tactile task in which the tactile stimu- 
lus is restricted to one hem 
while the visual stimulus is restricted 
to the opposite hemisphere (Sperry, 

1961). Furthermore, Doty (1961) 
has summarized several experiments 
which show that the callosal pathway 
15 unnecessary in the development of 
conditioned responses when the CS 
and UCS are applied to opposite hemi- 

spheres. The integrating circuits 


carrying out the foregoing activities 
must be subcortical į 


1 : rtica Thus, 
interhemispheric integration through 
the corpus callos: 


| s um weakens a cen. 
trencephalic position only to the ex- 


isphere 


n origin? 


: Interestingly, 
that the pretecto- 
"ur tract can be traced to the level 
of the pons wherein lies tł 

5 H ne 
cuneiformis. а 
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tent that the brain stem does not pro- 
vide all of the necessary circuitry to 
coordinate the activities of the two 
hemispheres. г 
Concerning the second point, while 
the work of Myers, Sperry, and others 
strongly suggests that the memory 
trace for visual pattern discrimination 
develops unilaterally in the split-brain 
animal, considerably more data are 
needed to establish that the memory 
trace for visual habits is formed within 
the cerebral cortex rather than within 
the brain stem. To cite some negative 
evidence, the memory trace for an 
avoidance-conditioned response to 2 
visual signal does not reside within the 
occipital cortex (Marquis & Hilgard, 
1936; Thompson, 1960; Wing & 
Smith, 1942). Schwartz and Clark 
(1957) have found that the memory 
trace for a discrimination between ? 
flickering light and a steady light i5 
not formed within the visual cortex: 
Recently, Bauer and Cooper (1964) 
reported that a discriminative response 
based upon luminous flux survives 
complete bilateral removal of the p05- 
terior half of the cerebrum. Final 2 
Cats (Hernández-Peón & Brust-Car- 
mona, 1961; Meyer, 1963) and тн 
(Thompson, 1959) deprived of mos 
of their neocortex learn certain tasks 
as fast as normals. The claim d 
the engram for simple habits 15 E 
calized at subcortical levels while = 
cortical gray mantle retains the e 1 
gram for complex habits is also i 
need of experimental verification. — 
In the succeeding account, an e 
pretation of the split-brain data E 
be presented within the framework r- 
а centrencephalic theory. This mor 
pretation is based upon the inton » 
tion concerning the functional chat?” 
teristics of the two occipito-fuE?, 
Pathways discussed previously p 
upon an assumption dealing with nm. 
Specific subcortical site of the engra 
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AN INTERPRETATION 


It will be recalled that the direct 

Occipito-mesencephalic tract is homo- 
lateral (Pearce, 1960; Valverde, 
1962). That is to say, the left occipi- 
tal cortex projects to the left mesen- 
cephalon, and vice versa. At this 
point, if it is assumed that the engranm 
forms at the terminal endings of the 
corticofugal pathway subserving the 
learned habit, then we can explain in 
terms of a centrencephalic theory the 
Presence or absence of transfer of cer- 
tain visual habits in the split-brain 
animal. Consider, first of all, the split- 
brain cat that is initially trained on a 
pattern discrimination with the left 
€ye only. Performance of this habit 
would be mediated by the left occipito- 
mesencephalic tract, and the memory 
trace would be formed within the left 
mesencephalon. When the right eye 
is exposed to the problem and the left 
eye covered, only the right occipito- 
mesencephalic tract is activated. No 
transfer of learning would be pre- 
dicted, The only way for transfer to 
take place is by channeling the visual 
input to the left occipito-mesencephalic 
tract. This can only be effected if the 
"mi optic fibers or if the callosal 
fibers are intact. Figure 2 (Part A) 
illustrates this phenomenon. 
v: is readily apparent that this 
а provides for the possibility 
nat split-brain animals can acquire 
antagonistic visual habits with the two 
eyes (Myers, 1956). The left occipito- 
mesencephalic tract could be utilized 
to mediate one habit with the left eye, 
while the right occipito-mesencephalic 
tract could be utilized to mediate an 
altogether different habit with the 
Tight eye. 

It has been reported that пеаг- 
threshold brightness discriminations 
will not transfer in split-brain animals 
(Meikle, 1960). As pointed out ear- 


Fic. 2. A semidiagrammatic representation 
of the split-brain animal showing the 
critical neuronal pathways conveying optic 
impulses to the brain stem. (The shaded 
pathways indicate the route over which 
visual impulses travel when the leít eye is 
exposed. See text for a further description. 
Part A: Neuronal pathways subserving pat- 
tern- and near-threshold brightness-discrimi- 
nation habits. Part B: Neuronal pathways 
subserving a simple brightness-discrimination 
habit. Abbreviations: CC, corpus callosum ; 
gl, lateral geniculate nucleus; MES, 
mesencephalon; OC, optic chiasma ; BG, 
posterior commissure; PRET. pretectal 
area; V. CORT., occipital cortex.) 


lier, а near-threshold brightness-dis- 
crimination habit is very probably 
mediated by the same direct occipito- 
fugal pathway that is utilized in pat- 
tern-discrimination habits. Thus, no 
transfer would be predicted. 

Let us now consider the simple 
brightness-discrimination habit which 
does transfer in the split-brain animal 
( Meikle, 1960; Meikle & Sechzer, 
1960). It will be remembered that a 
simple brightness discrimination is 
probably mediated by the occipito- 
pretecto-tegmental pathway. The en- 
gram, in this case, would develop at 
the terminal endings of the pretecto- 
tegmental tract (the region of the red 
nucleus and substantia nigra). While 
this projection is homolateral (Krieg, 
1947; Papez & Freeman, 1930), the 
left pretectum communicates with the 
right pretectum through the posterior 
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and tectal commissures (Bucher & 
Bürgi, 1952; Gurdjian, 1927 ; Kuhlen- 
beck & Miller, 1942). Thus, a split- 
brain animal trained on a supra-thresh- 
old brightness-discrimination habit with 
the left eye will utilize the left oc- 
cipito-pretecto-tegmental pathway and, 
by way of the subcortical commissures, 
the right pretecto-tegmental tract. 
When the right eye is now exposed to 
the problem, the right occipito-pretecta] 
tract is activated which, in turn, ex- 
cites both the left and right pretecto- 
tegmental tracts. Transfer of simple 
brightness discriminations would there- 
fore be predicted. Figure 2 (Part B) 
illustrates this phenomenon, 

It would follow that Sectioning of 
the posterior and tectal commissures 
as well as the crossed Optic and cal- 
losal fibers should Prevent transfer of 
simple brightness habits, Partial con- 
firmation of this Prediction comes from 
the work of Voneida (1963) who 
failed to obtain transfer o 


OTHER CONSIDERATIONS 


The basic assumptions of the fore- 
going account a 


to, the ventral 
A similar direct pro- 


- By 
making the claim that this pathway 
mediates Somesthetic discriminations 


plain the absence of t 


е г Tansfer of so- 
mesthetic discrimination 


habits in cal- 
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losum-sectioned | animals (Ebner & 
Myers, 1962; Stamm & Sperry, 1957). 
In a similar fashion, it is possible to 
explain the lateralization of the mem- 
ory trace in animals required to learn 
à simple conditioned response dur- 
ing unilateral Spreading depression of 
one hemisphere (Виге5 & BureSova, 


1960). 


It is noteworthy to mention the 
ease with which these assumptions 
Can be tested. In some respects, 
the nervous Structures and circuits 
Proposed to underlie visually guided 
behavior have been defined even more 
Precisely than originally intended. The 
luster of this Precision, however, is 
marred to the extent that the temporal 
cortex has not been fitted into this 
anatomical scheme. The specific role 
played by the temporal lobes in visual 
iscrimination Performance must await 
further elucidation of the functional 
pathways Proceeding to and from this 
Section of the Cerebral cortex. The re- 
Search plan followed by Chow (1961) 
may be fruitful in this regard. In any 
event, the functional significance of 
the temporal cortex would be pre- 
dicted to emerge from anatomical re- 
lationships either with the occipital 
Cortex, Pretectum, or brain stem. s 

Finally, a remark must be made in 
connection with possible phylogenetic 
differences in functional pathways me- 
diating discriminative responses. 
Throughout this paper, it has been 
assumed that the anatomical mecha- 
nisms underlying the performance of 
discrimination tasks are essentially the 
same in all mammalian forms. This 
assumption is basic to a centrencephalic 
theory which views the phylogenetic 
new brain (neocortex) as being an e 
tension of the phylogenetic old brain 
brain Stem). Despite the vast differ- 
ences in anatomical complexity, the 
functional interrelationships between 
the telencephalon and the dimesen- 
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cephalic zone are conceived to be pre- 
served throughout the mammalian 
Series. The anatomical review of the 
Corticofugal pathways is not inconsis- 
tent with this assumption. Further- 
more, it may be more than coincidence 
that an investigation of the brain-be- 
havior relationships in the rodent has 
led to the same conclusions as those 
drawn by Penfield's analysis of the 
neural mechanisms underlying human 
behavior, й 
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PRACTICE AND THE SIMPLEX 
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McGill University 


Jones' (1962) application of the simplex to practice matrices is exam- 
ined. It is argued that the simplex represents, not à process of simpli- 


fication as suggested by Jones, 


successive sets of scores, an idea whic 
that practice is a stochastic process. 


but a random-walk relation between 


h is compatible with the view 
]t is suggested that it is more 


meaningful to suppose that practice results in changing factor scores 
of individuals rather than in changing factor loadings of the task, as 
has more generally been assumed. Some methodological implications 


are discussed. 


i A number of authors have applied 
factor analysis to matrices which include 
intercorrelations between practice trials 
ze some _task (e.g., Fleishman, 1953, 
eH Fleishman & Hempel, 1954, 1955; 
dep 1952). The matrix of such 
fa ercorrelations exhibits a characteristic 

rm; the correlations between adjacent 
trials are highest, and they become pro- 
КАА smaller the further apart the 
наа аге in the sequence. Such a matrix 
has a rank of at least two, so that factor 
Pie ing yields more than one common 
actor. This has generally been inter- 
Preted as evidence that the factorial 
Structure of the task changes with prac- 
tice. One of the points made in this 
Paper is that this conclusion is not 
always warranted. It will be suggested, 
"s fact, that it may be more meaningful 
© suppose that the individuals change, 
not the task. 

TThe present argument derives from 
the observation made by Jones (1962) 
that every practice matrix so far pub- 
lished can be fitted to the simplex, а 
matrix originally developed by Guttman 
(1954), The simplex may be defined by 


! The author wishes to thank D. Bindra, 
G. A. Ferguson, and R. Traub of McGill 
University and H. Sampson of the Univer- 
Sity of Auckland, New Zealand, for their 
Critical reading of various drafts of this paper. 

e views expressed are, however, entirely 
the responsibility of the author. 


the following equation for “perfect 
simplicial form" (Jones, 1962) : 


fik = Рај Vik u] 


for all i<j <k, where it is assumed 
that the variables are ordered on somc 
criterion. The consequences of Jones’ 
observation will be examined closely in 


the present paper. 


Tue SIMPLEX AS SIMPLIFICATION 


Jones postulates that the following 
factor model underlies the practice 


simplex: 


1 =fitfetfat- + fn 
feat fate + fn 


хз = 
хз = В+ th [2] 
ха = fn 


where all variables are defined in devia- 


tion form only? 

According to Jones, this model de- 
fines a process of increasing simplifica- 
tion, an idea underlying Guttman $ 


з Еог convenience, the unique factors 5i 
postulated by Jones are omitted here. It is 
assumed therefore that we are dealing with 
the simplex in unattenuated form. Jones 
(1962) has described a technique for deriving 
the unattenuated simplex from its attenuated 


form. 
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original choice of the term "simplex" 
to describe the matrix. Jones argues 
that this interpretation is preferable in 
this case to the usual factorial interpreta- 
tion because it enables description in 
terms of a single dimension (simplicity). 
whereas routine factor analysis would 
require several dimensions. He de- 
scribes this approach as “molar,” in 
contrast to the “molecular” approach 
of conventional factor analysis. 

But there are difficulties with Jones’ 
approach. In the first place, the same 
correlation matrix may be obtained from 
the converse equations describing de- 
creasing simplification : 


mM = fi 
Xa = fit f: 
ws = fit fet fa [3] 


Xs = fi fa + fs +f, 


Jones’ choice of Equations 2 rather than 
Equations 3 to describe practice is based 
on somewhat imprecise psychological 
considerations; there is nothing in the 


correlational data themselves to indicate 
which is to be preferred. 


Moreover, Jonesa 
his defense of E 
sumption that E 
sent the only pos: 


ppears to have based 
quations 2 on the as. 
quations 2 and 3 repre- 
s sible alternatives. This 
is not the case. We may suppose, for 
example, that increasing simplification 
occurs up to a point, when decreasing 


simplification begins, as in the following 
equations: 


= fit fet fat f, 


ае fh 
ii E Sat fe 
CAN fa 4] 
Wise fs + fi 


= Fs + 5+} 
rc f F fe teti 


Clearly, we may go even further and 
consider any successive combinations of 
increasing and decreasing simplification 
Indeed, we may rotate axes so as fo 
render meaningless the notion of sim. 
plification, either increasing or decreas- 


ing, and still retain a factorial repre- 
sentation which generates a simplex. 
The concept of simplification, in 
Jones' account, also appears to depend 
on the proposition that the f; of Equa- 
tions 2 are themselves meaningful fac- 
tors. It is therefore a further weakness 
of his position that he was unable to 
identify the fi he obtained from Fleish- 
man's (1960) data. This might suggest 
in fact that some rotation of axes would 
lead to more meaningful interpretation— 
and destroy the concept of simplification. 
However, it might be argued that Jones’ 
attempt to identify the f; was ill advised 
and that Guttman (1954) in his develop- 
ment of simplex theory had in mind a 
dimension whose main justification was 
Precisely that it did not require the fi 
to be identified. Perhaps so, but it 
appears that Guttman nonetheless based 
his choice of the term "simplex" on а 
model such as is represented by Equa- 
tions 2, which implicitly demand that 
the f; have some meaning. In a theo- 
retical sense, if not in a predictive 
sense, the dimension envisaged by Gutt- 
man remains obscure. 
. The above criticisms apply to the 
interpretation of the simplex offered by 
Jones. It is not disputed that the sim- 
plex may provide a close fit to the 
practice matrix, in fact Jones' demonstra- 
tion that this may be universally so is 0 
great interest. [t appears, however: 
that a more general interpretation O 
the simplex is required before the full 


meaning of the demonstration can be 
evaluated. 


THE SIMPLEX as А Ranpom WALK 


Equations 2, 3, and 4 have an im- 
portant characteristic in common: Each 
variable differs from the preceding one 
by just one factor, or vector (perhaps а 
better term tham "factor" in this case 
since the factor content of these vectors 
15 not clear). Moreover the vectors by 
which each pair of adjacent variables 
differ are mutually uncorrelated. This 
relation may be expressed as follows: 


[5a] 


Mi = x; + faa 
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b ith a little algebraic manipulation, 
his equation may be written in standard 
score form as follows: 


Sip = кануа Уле ну Фф 


[5b] 


Where rii) ds the correlation coeffi- 
cient between i'^ and (i + 1)'* variables, 
and фы is the standard score equiva- 
lent of fiii. 

Assuming only that the relation. be- 
tween scores is linear, Equation 5b 
provides a general model for the simplex 
15 some unspecified factor space. Geo- 
metrically, the entire structure may be 
generated as follows. The first variable 
кыин е a unit vector in a 
| р rspace. The second variable is 
ocated by rotating the first through 
5 angle of cos7!riz. The third is 
-ocated by rotating the second in a plane 
: right angles to the plane formed by 
р. first two variables, through an angle 
gue rs. In general, the (i + 1)* 
V enu may be located by rotating the 
dul с in a plane at right angles to 
abi yperplane formed by the first 7 vari- 
Т} е, through an angle of cos ri, (i4-0- 
nr. = result is an n-dimensional struc- 
б t could have been generated by 

ginning with the final variable and 
Working backwards or by beginning with 
any intermediate variable and working 
outwards. 

The reader may gain a better intuitive 
a of the simplex structure by con- 
add . the following “random-table 
reel in preference to the geometric 
SUR К Any column іп a table of random 
the ы тау be chosen to represent 
io rn variable. The second variable 
cain e obtained by adding the next 

mn to the first. Succeeding variables 
may be generated by adding a new 
Column each time. Columns may be 
subtracted instead of added at any 
Stage. Provided that the number of 
Scores in each column is large enough, 
the matrix of intercorrelations between 
the variables should give a good approxi- 
mation to the simplex. ВУ weighting 
each column before addition (or subtrac- 
tion) with some appropriate scalar 
multiplier, it should be possible to 
Simulate алу given simplex. 
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This structure may be described as a 
random walk (whether through factorial 
hyperspace or through random tables), 
the word "random" being used in the 
sense that every step (ф;) is uncorrelated 
with every other. This random-walk 
relation. exists regardless of how the 
reference factors are defined or placed. 
In this respect its principal characteristic 
is molar rather than molecular, in the 
sense that Jones has emphasized. Моге- 
over, it exists for every solution of the 
simplex which conforms to the basic 
linear factor model, since the structure is 
unaffected. by any rotations of axes. 
Hence it is not necessary to choose be- 
tween Equations 2, 3, or 4 or any other 
possible variants of these equations; all 
of these sets of equations may be thought 
of as describing a random walk. 

A factorial interpretation of a random 
walk must be virtually impossible, par- 
ticularly in the present context where 
each variable is in fact a practice trial 
on some task. The vectors oi though 
random at least in the sense of being 
mutually uncorrelated, cannot be dis- 
missed as representing error variance, 
since we are considering the simplex in 
unattenuated form. To assign factorial 
meaning to these vectors would require 
at least as many factors as there are 
variables? When each variable is a prac- 
tice trial, moreover, it should be possible 
theoretically to obtain any number of 
factors, however large, by simply in- 
creasing the number of practice trials 
to the required amount. No conven- 
tional factorial interpretation can make 
sense of these properties. 

A molar view of the random-walk 
representation of practice permits the 
somewhat trite observation that the 
trials are ordered іп sequence; this 
sequence, of course, reflects exactly the 
temporal sequence in which the trials 
were performed. Kaiser (1962) has 
demonstrated that the simplex can be 


half this number are re- 
i i ini k model is chosen 

quired if the minimum ran | 

to describe the simplex (DuBois, 1960). 

Humphreys (1960) has already argued, how- 

ever, that no sense can be made of the simplex 

if the minimum rank model is chosen. 


з Actually only 
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interpreted as a ratio scale, but if we 
discard the notion of simplicity (or 
complexity), the problem is to decide 
what this scale measures. 

In general, then, neither the molecular 
approach of factor analysis nor the more 
molar approach suggested by Jones pro- 
vides a satisfactory theoretical interpre- 
tation of the practice simplex. Rather 
than abandon the simplex altogether, 
however, the following sections will 
explore the possibility of interpretation 
in terms of a linear model which will be 
presented as an alternative to the usual 
factor model. 


ALTERNATIVES TO THE Factor MODEL 


For N individuals, л tests, and ғ factors 
the conventional linear factor model 
may be written in matrix form as follows: 


Z= MF [6a] 
where Z is the n X N matrix of test 
scores, M is the л X r matrix of factor 
loadings, and F is the r x N matrix 
of factor scores! It is not crucial to the 
remainder of this section to specify 
whether this equation shall represent 
the total factor space or the common 
factor portion only, 


Equation 6a may be rewritten thus: 


zi mF 
ЕД mF 
ж = [6b] 
Zi mF 
Zn M,F 


where each z; isa 1 X N vector con- 
taining the scores of the ү individuals 
on Test 1, and each m; is al X r vector 
containing the factor loadings of Test i 
on the r factors, 


Equation 6b makes 


assumption of the facto, 
matrix of factor score 


plain an implicit 
r model: that the 
S, F, remains the 

‘Note that, followin 
standard text, the symb 
here to represent the 
F is also widely used to 
of factor loadings, 


g Harman's (1960) 
ol F has been chosen 
factor score matrix, 
represent the matrix 
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same for each test. That is, it is as- 
sumed that each individual retains the 
same score on each factor throughout 
administration of the test battery. In 
the majority of factor studies there may 
be no reason to doubt this assumption. 
However, there are certain situations in 
which its validity must be open to 
serious question, specifically when there 
is the possibility of transfer, learning, 
or maturational effects occurring be- 
tween tests. In such circumstances the 
following more general equation may 
be more appropriate : 


zi mifi 
5% жт»Ё» 
= и [7] 
Zi mF; 
Sn MaF 


where F; is the matrix of factor gn 
at the time when Test i is administered, 
and the F; are not necessarily the same 
for different values of i, К 

Although a model of this kind may 
eventually be necessary for a satis- 
factory factorial analysis of tests ate 
volving learning, transfer, or matura 
tional changes within the test battery: 
it is proposed here to consider a specia 
case only. In the case of practice e 
learning, when the tests are in fact 
Successive trials on the same test ОГ 
task, it seems a reasonable assumptlo” 
to make that the factor loadings ü 
this task do mot change from trial à 
trial; that is, let us assume that m; = 


5 1 y be 
say, for all i, Equation 7 may now 
written: 

ET mFi 

23 mF: 

x. [81 

Zi mF; 

Zn mF, 

I int 
It might be objected at this poU 


to 
that Equations 7 and 8 tun aun 
the principal of parsimony, since d can 
are hypothetical constructs an 
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always be defined so as to conform to 
Equations 6a and 6b; that is, one cer- 
tainly can factor analyze the simplex 
in terms of the usual factor model and 
obtain factors. Equations 7 and 8 may 
therefore seem superfluous. However, 
as has already been argued, factors ob- 
tained from the simplex and based on 
ше usual factor model as represented 
by Equations 6a and 6b are uninterpret- 
“г „Further, it will be argued in the 
ollowing section that much better sense 
can be made of the practice simplex, 
x least, if the model represented by 
ee 8 is chosen in preference to 
ре by 6b, although ulti- 
Ба у а more complex model such as is 
represented by 7 may give the more 
penetrating solution. 


INTERPRETATION OF THE PRACTICE 
SIMPLEX 


a us suppose now that the random- 
i relation inferred above exists, not 
i сз „the factor loadings from trial 
6) px (i.e., between the әл; of Equation 
ped but between the factor scores (i.e., 
k3 Ee the F; of Equation 8). That 
mr assume that the factor scores for 
h individual are subject to small 
Ай changes from trial to trial. 
EDAM it might be expected of course 
that scores on a practice task would 
vc from trial to trial. Since 
Fee ee data only convey informa- 
Son about scores relative to their mean 
с the present approach is not 
nsistent with the notion that mean 
cane improve from trial to trial. We 
only suppose that changes are 
random about some mean change; again 
random" in the sense that every set 
of score changes is uncorrelated with 
every other set. This state of affairs 
may be represented by the following 
equation: 


Fig = аца: + V1 ан Саа [91 


where Са is ther X N matrix of random 
elements (change scores), and агы is 
some scalar multiplier. 

It can be shown t 
generates a simplex as f 


hat Equation 9 
ollows: First, 
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the correlation between any two trials, 
i and j, is given by: 


rj = Nai) [101 
where z;' is the transpose of 5. From 
Equation 8 we have z; = mF; and 


z; = mF;, so that Equation 10 becomes: 
[11] 


Now consider any three trials, i, j, and k, 
such that i <j < Ё. We may consider 
these three trials to be consecutive trials 
by disregarding those intervening be- 
tween them, and from Equation 9 we 
may write the following equations con- 
necting Fi, Fj, and Fx: 


ту = 1/NmF;Fj'm' 


^ za + vi — aj: Cj [12] 
Fe = а + V1 — aj? Cr 


where 


F;C/ = “С = F;Cx’ = C;C = 0 
(the null matrix) [13] 


It follows from Equations 11, 12, and 13 
that the correlations between trials 7, j, 
and k can be written: 


f aj/ N- mF;Fém 
Yik ajar/ N: тЕ:Е т! 
тк = а №: mF Fim 


[14] 


These correlations satisfy Equation 1 
which defines the simplex. 

Several arguments may be advanced 
in favor of this interpretation of the 
practice simplex rather than that based 
on the usual factor model: 

1. The apparently jnsurmountable dif- 
ficulties of interpreting a random-walk 
relation between factor loadings are 
overcome since i in the 
present interpret f 
content remains constant from trial to 
trial. The individuals, 
change; а possibility which Humphreys 
(1960) has recognized. This notion has 
the further advantage that a test may 
still be “cataloged” with its factor con- 
tent specified regardless of the state of 
practice among the individuals, whereas 
in the approach exemplified by Fleish- 
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man's work it would be necessary to 
qualify the factor content depending on 
how practiced the individuals were. 

2. The notion of a random-walk rela- 
tion between scores is not a new concept in 
learning. For example, Estes (1959) 
has proposed a random-walk model for 
choice behavior. Generally, the present 
model accords well with stochastic 
theories of learning and may even pro- 
vide a useful technique for the confirma- 
tion and extension of such theories. 

3. Several workers have failed to 
identify any kind of learning factor based 
on correlations between score changes 
and other criterion variables (Simrall, 
1947; Woodrow, 1946). This is con- 
sistent with the present view that score 
changes in learning are random, 

4. The present interpretation would 
imply that any factors derived in the 
usual way by factor analyzing a simplex 
would be artifacts, Such “artifactors” 
are generally easily recognizable. Typi- 
cally, they show Systematic changes in 
loadings from trial to trial, 
from a maximum at some point in the 
sequence of trials, f two artifactors 
are extracted, as is commo 
they are generally 
metrical positions suc 
maximum loading tow 
the sequence, 
towards the opposite end. 


tapering 


^ Because of 
the simplex "border effect" (Guttman 
1954), however, there is a drop in 
loadings for each end trial. Finally, as 


might be expected, artifactors 


е аге репег- 
ally Uninterpretable in terms of any 
reference battery of tests. All of the 


data. 
The above arguments are Consistent, 
then, first with the view that score 


changes in practice are random (about 
some mean change) and second with 
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the view that these changes represent 
changes in factor scores, not factor 
loadings. It is possible, however, that 
the case has been overstated. At least 
some of Fleishman's data give fairly 
convincing evidence for genuine change 
in factor loadings with practice (e.g. 
Fleishman & Rich, 1963). Further, 
Manning and DuBois (1962), using , 
more sophisticated techniques than ear- 
lier workers such as Woodrow (1946), 
who were not able to do so, have reported 
evidence for a learning factor. Their 
finding suggests that score changes with 
learning or practice may not be alto- 
gether random; the walk may not always 
be as random as the rigorously defined 
simplex implies. We might speculate, 
perhaps, that practice is a stochastic 
process from the point of view of cach 
individual's performance but that there 
may exist consistent differences between 
individuals in the parameters of this 
stochastic process. The simplex com- 
ponent of practice matrices might there- 
fore reflect the stochastic properties of 
Change within individuals, while what 
evidence there is for genuine change in 
factor content with practice might reflect 
parametric differences between indi- 
viduals. Such differences might be ex- 
pected to be difficult to isolate since they 
are confounded both by the stochastic 
properties of change within individuals 
and by measurement error. 


SOME METHODOLOGICAL 
IMPLICATIONS 


The issues discussed in this paper raise 
a number of methodological points: 

1. The most important is that the 
practice matrix should not be factor 
analyzed or included in any matrix to 
be factor analyzed since it is likely that 
artifactors resulting from changes !" 
factor scores rather than changes 1" 
factor loadings will appear. Determina- 
tion of factorial change with praetor 
Should be based on correlations of tria 
Scores with criterion factors or tests 
defined independently of the practice 
Matrix itself. This requirement is satis 
fied in the study of two-hand coordina- 
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tion by Fleishman and Rich (1963) but 
not in Fleishman's (1960) study of 
rotary pursuit learning. ` 

‚ 2. It is also important that determina- 
tion of factor content at any stage in 
learning or practice be evaluated by 
reference to criterion tests administered 
at that stage. For example the random- 
walk interpretation of the practice sim- 
plex presented above would lead us to 
Predict that correlations of trial scores 
With some criterion would in any case 
decrease with practice provided that the 
Criterion task is administered before 
Practice begins. This follows not be- 
[See ae practice task is becoming less 
i T ше criterion task but because the 
ndividuals themselves are becoming less 
like what they were when the criterion 
task was administered. Conversely, if 
the Criterion task is administered after 
Practice, we might predict increasing cor- 
relations. Fleishman and Rich (1963) 
do not state explicitly when in relation 
to practice their two criterion tests were 
administered, but if we assume them to 
have been administered before practice 
began, then the finding of increasing 
Correlations between “kinesthetic sensi- 
hold and the practice task is more 
Convincing demonstration of genuine 
factorial change associated with the test, 
Not the individuals, than is the finding 
of decreasing correlations between “aerial 
Orientation” and the practice task. 

_ 3. The confounding influence of chang- 
ing factor scores might best be elimi- 
nated either by “removing” the simplex 
component using a technique such as 
Kaiser (1962) has suggested, or by carry- 
Ing out separate factor analyses before 
and after practice. In the latter alterna- 
tive the test batteries should include, on 
the one hand, the first trial and criterion 
tests administered before practice and, 
on the other hand, the final trial and the 


5 Surprisingly, there seems to be a lack of 
evidence in the literature as to whether these 
Predictions hold in fact. Authors either do 
Not state when in relation to practice the 
criterion tasks were administered, or they 
confuse the issue by presenting the criterion 
tasks before practice to half the subjects and 
after practice to the other half. 
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same (or parallel) criterion tests ad- 
ministered after practice. Actually an- 
other issue, besides that involving chang- 
ing factor loadings versus changing 
factor scores, is involved here. It 
concerns the nature of transfer. Sup- 
posing that scores on some factor repre- 
sented in the practice task do change 
with practice, does it follow that the 
same changes will occur on this factor 
as it may be represented in other tasks? 
If the answer is yes, we would expect, 
on the assumption of changing factor 
scores but invariant factor loadings with 
practice, that factor analyses before and 
after practice would vield essentially 
the same results (Humphreys, 1960). 
If changes in factor scores on the prac- 
tice task are not in general accompanied 
by changes in the same factor scores as 
measured by other tests (a possibility 
which would raise some new difficulties 
for factor theory), then we might expect 
that loadings of factors on the practice 
task should be smaller after practice 
than before but that the general pattern 
of loadings should remain the same. 
Finally, genuine change in factor content 
of the task should result in altered pat- 
tern of factor loadings after practice. 
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The range-frequency theory is concerned with ca 
"large" “medium,” 


dgments from 2 basic assumptions: (a) 


like "good" and “bad,” or 
specific model derives the ju 


The judge divides his psychological range into subranges w 
the stimulus conditions; and (b) he employs 


ith equal frequency. The model uses judg- 


sizes are independent of 
the alternative categories w 


ments obtained when stimuli are presented with equa 
when stimuli are presented w 
Iso used to evaluate the weighted-mean 


It is concluded that category judgments 


predict the judgments 
frequencies. These data are a 
model for adaptation level. 


are more adequately explained by the range- 


by the theory of adaptation level. 


This paper presents a theory of 
Category judgment, tested in experi- 
ments with squares which vary in size. 
The experimental task requires the 
Subject to judge each square as it is 
Presented, using an ordered set of size 
Categories, from "very small" to "very 
large." 

In this task, as in everyday experi- 
ence, the subject must establish his 
Омид standards for using the categories. 

Large,” like “beautiful” or “worth- 
while,” relates what is being judged 
to some implicit frame of reference. 
For the squares, the judgments adjust 
to the range of sizes in the experiment. 


After repeated presentations, “very 
large” is reserved for the largest, 
very small” for the smallest, with the 

1 This research was supported by National 
Science Foundation Grant GB-1768 and 
Public Health Service Grant HD-00923. 


tegory judgments, 
and "small" A 


hose relative 


1 frequency to 
vith unequal 


frequency theory than 


other categories indicating the order of 
the intermediate sizes. However, the 
subject does more than merely rank 
the squares: His judgments reflect his 
sensitivity to the degree of difference 
between successive sizes and to their 
relative frequencies. His frame of 
reference or context for judgment ap- 
pears to be the frequency distribution 
of stimulus values. 

It is useful to study simple per- 
ceptual judgments because they are so 
dependent upon the immediate stimulus 
context. Since the presentations are 
controlled by the experimenter, the 
procedure permits independent manip- 
ulation of important features of the 
context. In the present experiments, 
the stimulus frequencies are manipu- 
lated in order to test a theory of judg- 


ment. The objective is to account for 
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the changes in judgment of each of the 
squares. 


THe THEORY 


The range-frequency theory treats 
category judgment as a киран e 
between two principles: One principle 
is concerned with how the judge di- 
vides the stimulus range, the other 
with how frequently he uses different 
categories. 


The Range Principle 


The first principle asserts that the 
judge uses the categories to subdivide 
the psychological range, the difference 
between the two extreme values | of 
the stimuli that form the psychological 
context for judgment. A subrange cor- 
responds to each category. The rela- 
tive size of each subrange depends only 
upon the particular set of categories 
and is independent of stimulus condi- 
tions. 

As a simple example, suppose there 
were just two categories, “large” and 
“small,” which the subject used for 
stimuli in the upper and lower halves 
of his range, According to the range 
principle, a change in the stimulus 
frequencies would not affect his judg- 
ments of any of the stimuli unless it 
also changed his psychological Tange. 
Each category would always cor- 
respond to half the range 

Although de 


pendent upon the physi- 
cal range, the psychological range can- 


not be directly inferred from it. Thus, 
à subject may believe that the stimulus 
he is judging is the smallest that has 
been presented, even though it has been 
preceded by even smaller stimuli. His 


psychological range must be inferred 
from his judgments, 


The Frequency Principle 


The second principle asserts that the 
judge uses each category for a fixed 
proportion of his judgments. These 
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proportions may vary for different as 
of categories, but we can often y in 
equal proportions. This will be bo 
for the present tests of the theory. 
frequency proportions are fixed in ee 
sense that they are independent of t 
stimulus conditions, as with the relative 
sizes of the subranges. 2, h 
But unlike the range principle, se 
frequency principle implies that em 
judgments are affected by the stimu дв 
frequencies. Suppose that in the y 
ample in which "large" and г. 
corresponded to equal subranges, К ж 
larger stimuli were presented more 2 
quently ап the smaller m a 
“Large” would then be used more fre 
quently. To conform to the frauen 
principle, “large” would have to со 
spond to a narrower subrange. m 
case, the range and frequency princip. 
would conflict. | йеп 
The two principles conflict w is 
stimuli from different parts of E ^e 
range are presented with unequal e 
quencies. The theory postulates t = 
the actual judgments reflect a „Р 
mise between the two principles. is 
nature of the compromise will now - 
specified in a model which yields iiem 
titative predictions of the judgments. 


Tue Моркг, 


The range-frequency model er 
with category limens. A MOSR 
simply the boundary between two s as 
gories. In practice, it is define re- 
the stimulus value that, if it were [ше 
sented, would be judged half the (or 
With the higher of two categories this 
With still higher categories). In a 
model, stimuli and limens are es 
оп a Thurstone scale of equal ger- 
criminability (as described in paue 
son, 1958) rather than on a p re- 
Scale. Since the Thurstone pe 
lates the judgments to the emp 


s ict these 
limens, the model must predict 
limens, 


CATEGORY JUDGMENT: A MODEL 


The model can be briefly stated, 
using three definitions and a single law 
of judgment: 


D1. Empirical limens: the values of 
the boundaries between successive cate- 
gories on a stimulus scale of equal dis- 
criminability. 

D2. Frequency limens: the values the 
empirical linens would have if the cate- 
gories were used with equal frequency. 

D$. Range limens: the values the 
empirical limens would have if they di- 
vided the psychological range inde- 
pendently of the stimulus frequencies. 

J4. Each empirical limen is the mean 
of the corresponding range and fre- 
quency limens. 


This model simplifies the theory so 
that it can be tested experimentally. 
The compromise between the two prin- 
ciples of judgment places each em- 
Pirical limen halfway between a range 
and a frequency limen. Thus, each 
pair of empirical and frequency limens 
determines a range limen. The range 
limens can be calculated from the judg- 
ments obtained when stimuli are pre- 
sented with equal frequency. Since 
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the range limens are independent of 
the stimulus frequencies, they can be 
used to predict the empirical limens 
and judgments for stimuli presented 
with unequal frequencies. 


Calculation of the Limens 


This section describes how the dif- 
ferent limens are obtained and their 
relationships to the actual judgments. 
The examples should make the mean- 
ing of the limens more concrete. How- 
ever, this material is not essential to a 
general understanding of the range- 
frequency model, and the reader can 
follow the experimental tests and 
evaluation without working through 
the details of this section. 


Figure 1 illustrates how the empirical 
limens are located by plotting the propor- 
tions of the judgments that place each of 
the stimuli above each of the empirical 
limens. Thus, the largest square, Stimulus 
9, is shown as being placed in the topmost 
category (ie, above the limen separating 
Categories 5 and 6) for about 50% of its 
presentations, and in either of the top two 
categories for about 95% of its presenta- 
tions. 

Figure 2 schematizes the location of the 
frequency limens for the same Rectangular 


.99 


.90 


PROPORTION ABOVE LIMEN 


hth 


NORMAL DEVIATE 


JT 
i 


STIMULI 


Fig. 1. Empirical li 


limen for each stimu 


mens determined from, propo _judgm: 
lus in Rectangular distribution. 


[Le П 
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rtions of judgments greater than each 
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IMULUS : 1234 56 
pne BUD (5) (5) (5) (5) (5) (5) (5) (5) (5) 


ЖКХ 


LIMENS: FREQUENCY-- 1 1 
RANGE — | 1 1 1 1 


EMPIRICAL— 1 1 


Fic. 2. Distributions of hypothetical val- 
ues of frequency limens and their a 
ships to empirical and range limens for 
Rectangular distribution. 


distribution of stimulus presentations that 
yielded the judgments for Figure 1. Vari- 
ability of judgment is shown in Figure 2 
as a normal distribution of values for each 
of the five frequency limens. This. varia- 
bility is attributed to shifting locations of 
the limens on the psychological scale of size. 
Since the variability is in the relationship 
between the stimuli and the limens, it could 
be portrayed equally well using a distribu- 
tion of values for each of the nine stimuli 
or, following Thurstone, using distributions 
of values for the stimuli and also for the 
limens. 

The SD of each of the normal distribu- 
tions in Figure 2 is one stimulus step, the 
difference between successive stimulus val 
on the psychological scale of size. 
the approximate stimulus difference 
the .50 and .84 values for each of 
functions in Figure l, so that the 
Thurstone scaling of these data would yield 
stimulus values separated by unit variability. 
The range-frequency model could also be 
applied by using different SDs for the dif- 
ferent limens, by scaling the stimulus di- 
mension with one of the more complex 
Thurstone procedures, or by a procedure 
for equalizing the differences between suc- 
cessive range limens, 

The fre 
off 


ues 
This is 
between 
the five 
simplest 


con- 
fre- 


pre- 
y in the Rectangu- 
ighest category would 
be used for 1/9(1/6+1/2+ 5/6) =1/6 of 
the total number of stimulus presentations, 

Each of the five frequency limens, either 
for the Rectangular or for any other distri- 
bution, can be determined by successive ap- 
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proximation so as to mark off su us 
quencies of category use. It is и 
to perform this approximation on п е 
probability paper, starting with the em p d by 
limens. Thus, the proportions indica с їп 
the best-fit line drawn for the top bs 50. 
Figure 1 are approximately .02, .16, a opor- 
Cumulative multiplication. of these p ich 
tions by the proportion of trials on о 
the corresponding stimuli, Were eri" 
(1/9 for the Rectangular distribution) И ion 
:076 as the value representing the pro 
of presentations judged with the top em 
gory. This is less than the suu Sei the 
proportion which is the reciprocal 0 cate- 
number of categories C167 for six there- 
gories). The frequency limen must limen. 
fore be lower than the empirical one 
To locate the top frequency TAE a 
shifts the best-fit line for the top smpn - 
limen to the left, without changing its we 48 
The proportions of the stimulus pre War 
tions are then noted for the new om 
If the cumulated proportion of all os 
tions is again less than the шагы stil 
Proportion, the line must be relocates the 
further to the left; if greater Lapeer 
equal-frequency proportion, it must Бе ЧА dis- 
back to the right. For the Rectangu S ПБУ 
tribution, the line for the top еШ; ог 
limen was finally located so that its jas 
limen value corresponded to виши ihe 
Thus, the normal curve that гере» the 
distribution of momentary values Сти" 
top frequency limen is centered over e was 
lus 8 in Figure 2. The same procedur nens. 
applied to locate the other frequency lim em- 

The model asserts that cach of the iate 
pirical limens is the mean of the ud em- 
frequency and range limens. Once e de- 
pirical and frequency limens have bee ute 
termined, the range limens can be er limen 
algebraically, Since the top шр. r the 
is 9 and the top frequency limen is 8 onding 
Rectangular distribution, the corresp 
range limen must be 10. : ег 

The relationship between the аш 
limens is shown at the bottom of Fg mens 
In this example, only the frequency timulus 
are symmetrical with respect to the 5 try 15 
values. The deviation from Sara the 
toward higher values for the pm ep 
empirical limens. By analogy to parative 
tive time-order error found for beh d 
judgments of visual size, one mig) nee pre- 
late that the subjects remember, were 
vious Presentations as larger than mney ented 
Or that subjects compare the or i 
Squares with either a larger McClelland, 
larger areas in the visual field ( 


ent 
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1943). Whatever the basis for this phe- 
nomenon, the range limens provide a quanti- 
tative index for the shifted frame of refer- 
ence. This index is free of the postulated 
bias toward equal use of the different cate- 
gories. 

„Frequency limens can be located for any 
distribution of these same nine stimuli, as- 
suming the same SDs but different locations 
for cach of the limens. Figure 3 shows the 
limens for an irregular distribution of pres- 
entation frequencies. Hypothetical normal 
curves are again located so as to mark off 
equal frequencies of category usc. For this 
distribution, the proportion of the area cut 
off by each stimulus must be weighted for 
the relative frequency with which the stimu- 
lus is presented. The frequency limens were 
again located by successive approximation, 
cach category including 1/6 of the presenta- 
tions. Consequently, these limens are most 
closely spaced in that part of the range 
where the stimulus values occur most fre- 
quently. 

The range limens are assumed to have 
the same values for any distribution of 
these nine stimuli, Each range limen for 
the Rectangular distribution can then be 
averaged with the corresponding frequency 
limen for any other distribution to predict 
the empirical limen for the other distribution. 
These predicted values can then be com- 
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STIMULUS: 1 4 
(2) (2) (2) (17) (0) (2) (2) (2) (6) 


FREQ./BLOCK: 


LIMENS: FREQUENCY—~ ! 1 
EMPIRICAL--I 1 1 
RANGE-I 1 | 


Fic. 3. Distributions of hypothetical val- 
ues of frequency limens and their relation- 
ships to empirical and range limens for Low- 
Median distribution. 


pared with the empirical limens. Or in- 
stead, as in Figures 2 and 3, the empirical 
limens can be combined with the frequency 
limens to calculate the range limens. Ас- 
cording to the model, the latter should have 
similar values for different distributions of 
the same stimuli. 

The stimulus-judgment matrix can be 
predicted for each distribution using the 
predicted empirical limens and the slopes 
fitted for the Rectangular distribution. The 
obtained frequencies can then be compared 
with these predicted frequencies. However, 
it is easier to evaluate a comparison based 
upon the predicted mean judgments. These 
are readily calculated from the predicted 
stimulus-judgment matrix. 


TABLE 1 


DISTRIBUTIONS OF STIMULUS PRESENTATIONS 


Stimulus number 1 2 3 4 5 6 7 8 9 
Width (in cm.) 5.4 67 8.3 10.5 13.3 16.2 18.7 | 207 | 232 
Distribution Stimulus frequencies 

Rectangular & | 5 5 5 5 5 5s | s 5 
Negatively Skewed 2 2 3 4 5 6 7 8 8 
Positively Skewed 8 8 7 6 5 4 Е 27 2 
Normal 2 3 5 8 9 8 е es 2 
U-Shaped 8 6 4 3 3 3 4 6 8 
Low-Median 2 2 2 17 10 2 2 2 6 
High-Median 6 2 D » | 10 | 4l 2 2 A 
Low-Midpoint 2 D 2 | 42 9 6 | i2 0 0 
High-Midpoint 0 0 12 6 9 12 2 ; i 
Low Midpoint-Median 2 » s | a3 5 5 7 7 0 
High Midpoint-Median 0 6 7 5 5 13 5 à 
Low-Mean 11 3 3 3 5 11 3 3 E 
High-Mean 3 3 3 11 5 3 3 3 
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JUDGMENT 


MEAN 


MEAN 


JUDGMENTS «— —. 
E-D VALUES 


=---— 


STIMULUS 


Fic. 4. Mean judgments and Thurstone 
equal-discriminability values for stimuli in 
Rectangular distribution. 


EXPERIMENTAL TESTS or THE Моркт, 
New Data 


Five different distributions of stimu- 
lus Presentations were used for the 
Same set of nine squares, Frequencies 
per block of 45 presentations are shown 


in the top half of Table 1. Each dis- 
tribution w 


bgroups. Pres. 
in two blocks of 45, 
nd block was tabulated 
so that attention is restricted here to 
judgments made after at least one ex- 
posure to the entire distribution, 


Only the seco 


The 
subject’s task was to judge each square 
in comparison with the other squares 


in the series. Judgments were made 
with the following six categories: 
“very small—1," “small—2,” “slightly 
smaller than average—3," “slightly 
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» 
larger than average—4,” B 
and "very large—6." — Additiona ни 
tails of procedure have been presen 
elsewhere (Parducci, 1963). sadi 

Figure 4 shows that the mean jud А 
ments are linearly related to the o 
order values of the stimuli when ^ o 
latter are presented with equal 4 
quency. The plot for the ‚шоп 
values (calculated assuming eq ТЕ 
stimulus and category dispersions) у= 
more negatively accelerated. mon 
ever, both functions suggest that es 
stimuli are spaced ae 
equally on the psychological scale n 
Size, as was assumed for the plots 
Figures 1, 2, and 3. jor 

Predicted values were calculated етй 
the mean judgments for the Negative | 
Skewed, Positively Skewed, se 
and U-Shaped distributions in Table еч 
Frequency limens were calculated bi a 
rately for each of these four distri ed 
tions. Range limens were аши 
using the data from the Rectangu s 
condition, as in Figure 2. The pr " 
dicted value for each empirical а 
is the mean of each pair of a 
and range limens. These means dme 
then used to construct the etel 
Judgment matrices from which the Ted. 
dicted mean judgments were er ce 
This procedure bases the pee 
solely upon the judgments of the = 
tangular distribution so that all p 
dictions could be made before "tot 
other distributions were presented 
judgment, А s 

The general trend of the задо 
in Figures 5 and 6 is successus p 
dicted by the model. The pre mie 
functions capture the overall wy rnt 
in the level of judgment for the Ske dif- 
distributions in Figure 5 and pe i 
ference in slope for the Normal an The 
Shaped distributions in Figure б; ross- 
model also predicts the point of сс à 
over for the two functions in Figu 


CATEGORY JUDGMENT: А MODEL 


Nevertheless, some of the errors of 
prediction are systematic. The stand- 
ard error for each empirical point, 
based upon the variability between the 
mean judgments of the same stimulus 
by the different subjects, is on the 
order of .1 category step. However, 
this overestimates the significance of 
the errors of prediction since the pre- 
dicted values are themselves based 
upon variable data from the Rectangu- 
lar distribution, and the different errors 
are not independent. 


Old Data 


The bottom eight distributions in 
Table 1 were studied 2 years earlier, 
and some of the mean judgments were 
reported before the present model was 
developed (Parducci, 1963). Figures 
7-10 show the predicted functions 
along with the actual mean judgments. 
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Fics.5-6. Mean judgments for г 


new distributions of stimuli, 
empirical points. 
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The model again captures the general 
relationship between the sets of means 
obtained for each pair of distributions. 
But again, some of the differences be- 
tween predicted and obtained values 
leave room for improvement. 


Minor changes in procedure were required 
for the derivation of these predictions. Al- 
though the experimental procedure was very 
similar, the judgments had been collected 
in a different laboratory. Only the Rec- 
tangular distribution was used in both lab- 
oratories. The category limens were higher 
for the earlier data using this distribution, 
and the mean judgments were up to 1/3 
category step lower. Consequently, the 
prediction for the distributions described in 
this section are based on the earlier data 
for the Rectangular distribution. Figures 1 
and 2 were also based on these data. 

These eight distributions had been selected 
to test an earlier formulation of the range- 
frequency theory that predicted differences 
for the middle limen when either the medians 
or midpoints of the stimulus distributions 
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predicted functions, and 
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10. Mean judgments for old distributions 


; 1 d 
dis 9f stimuli, predicted functions an 
empirical points, 


CATEGORY JUDGMENT: A MopEL 


меге different. The stimulus means аге 
identical for all but the last two of these 
distributions. Тһе medians differ by one 
stimulus step for the Low- and High-Median 
distributions, but both have the same mid- 
point (the mean of the two end values). 
The medians are identical but the midpoints 
vary for the Low- and High-Midpoint dis- 
tributions, and both measures vary together 
for the two Midpoint-Median distributions. 
Since the stimulus range was reduced by a 
factor of 7/9 for the Midpoint distributions 
and 8/9 for the Midpoint-Median distribu- 
tions, the hypothetical range limens were 
compressed by the same factors in order to 
make predictions from the model. For each 
truncated distribution, the lowest range 
limen was set slightly above the lowest 
stimulus (as in Figure 2), the same factor 
being applied to determine this difference 
and also each of the successive subranges. 

Only the stimulus means vary for the Low- 
and High-Mean distributions which were 
judged under somewhat modified instruc- 
tions. Subjects were required to record each 
presentation number, 1 through 90, in the 
column labeled with the appropriate cate- 
gory on their response sheets (Parducci, 
1963). Column recording raised the em- 
pirical limens so that the predictions for 
these two distributions were based on data 
from a separate group which had judged 
the Rectangular distribution under the modi- 
fied instructions. All predictions were cal- 
culated in accordance with the procedure 
used for the new data. 


Comparisons of Range Limens 


Empirical limens were computed 
from the judgments of each distribu- 
tion following the procedure described 
for Figure 1. These were combined 
with the frequency limens to calculate 
separate sets of range limens for each 
of the 13 distributions in Table 1. 
These range limens have similar values 
for all distributions with the same 
stimulus range, but they span a nar- 
rower range for the distributions whose 
stimulus ranges are narrower. The 
differences between successive, €m- 
pirically determined range limens, that 
is, the subranges, are proportional to 
the stimulus range. In general, the 
five limens in each set are equally 
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spaced. This suggests that the model 
might be simplified further by in- 
corporating both equal-subrange and 
equal-frequency principles. 


Weighting the Limens 

Taking the unweighted mean of each 
pair of range and frequency limens 
is an arbitrary simplification. How- 
ever, moderate changes in weighting 
would scarcely affect the predictions, 
and the errors of prediction were 
systematically biased toward neither 
the range nor frequency limens. A 
separate empirical weighting was de- 
termined algebraically for each fre- 
quency limen for each distribution, 
using the associated empirical and fre- 
quency limens and the range limen 
from the Rectangular distribution. The 
mean weighting of the frequency limen 
is .53 with a standard error of .05. 
The weighting might be expected to 
vary systematically with manipulation 
of other conditions such as the dis- 
criminability of the stimuli or the in- 
structions for judgment. 


EVALUATION 


It is clear that the distribution of 
stimulus frequencies has important ef- 
fects upon judgment. Тє is also clear 
that the range-frequency model de- 
scribes both the direction and the gen- 


eral order of magnitude of these ef- 


fects. How well does it compare with 
alternative interpretations of judg- 


ment? 


Adaptation-Level Theory 

The most ambitious and influential 
the problem of predicting 
the judgments of each stimulus has 
been Helson's theory of adaptation 
level (AL). The theory has been used 
to describe a wide variety of situations 
(Helson, 1964). Its first applications 
were to category judgments (Helson, 
1938, 1947), and it is with respect to 


approach to 
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the present type of data ае theory 
ad its most rigorous E | 
| yer assumption of AL rk 
is that the psychological effects of = 
stimulus context can be ko 
by a single stimulus value, AL. | е. 
is the value of the stimulus jud ge 
"average" (3.5 in the present task). 
The AL model that has been applied 
to psychophysical data (Helson, 1964, 
pp. 197-202) asserts that judgments 
are linearly related to the difference 
between each of the stimuli being 
judged and AL. This assumes that 
the stimuli have been Suitably scaled 
with respect to the psychological di- 
mension of judgment. The linearity 
of the judgment function for the Rec- 
tangular distribution (Figure 1) indi- 
cates that these squares have been 
suitably scaled. AL theory predicts 
that the other functions will be just 
as linear as the Rectangular function, 
To the degree that they are, AL the 
predicts the present data. . 
Although the empirical functions are 
all roughly linear, Straight-line fits 
would miss their other characteristic 
features. Thus, the functions. inter- 
twine for the Low. and High-Median 
distributions (Figure 7), The same 
intertwining has been found using 
different methods of recording judg- 
ments and also for such varying di- 
mensions as lifted weight and numer- 
Ousness of dots (Parducci, 1963). 
Linear fits would also miss the pinch- 
ing at the middle stimulus for the Low- 
and High-Mean distributions (Figure 
10). 


ory 


A second basi 


S for comparison is 
provided by the q 


ifferences in the value 
of AL for different distributions of 
stimuli. The Predictions for these 
differences are derived from the pool- 
ing feature of the theory which asserts 
that AL is a weighted mean of all of 
the stimuli affecting judgment, It 
can be assumed that the effects of 
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earlier stimulation and also of ced 
ground stimuli do not differ Sys dl 
atically for the present Mp eee js 
conditions. In this case, the egg 
AL model, the one that is "prt 
this type of situation. (Helson, rae: 
p. 132), describes AL as а Son 
function of the mean of the gg i eet 
of stimuli presented for esters 
With the exception of the up va 
and Positively Skewed and the с. 
and High-Mean distributions, the p 
ent distributions were selected so the 
their means were each 5.0 on iL 
stimulus scale, Contrary to the Ше 
model, the empirical ALs, that p^ 
stimuli judged 3.5; have the rend 
value for two of the писана 
with varying means (Figure a 
differ systematically for distri E: 8, 
with the same mean (Figures Pise 
and 9), py contrast, the aedi 
quency model correctly pua Sen 
the direction and general order о 
nitude of the differences in AL. -— 
Finally, a third basis for gere 
is provided by the slopes of the j AL 
ment functions, According to the i 
model, slope is determined by the d by 
ber of judgment categories an Dura 
other factors which were not m 
lated in the present research [Una NT 
1964, pp. 199-201). Thus, ; ob the 
model predicts that the slopes ia 10 
functions in Figures 4 throug "n 
Should all be equal. They ord 
The slope is greater for the inan 
than for the U-Shaped "vw peine 
(Figure 6) and for the m: А 
with narrower ranges (Figures ae 
9) than for those with wider getty 
(Figures 4, 5, 6, 7, and 10). much 
the Tange-frequency model does linear 
better, in this case capturing the wed 
component of each of these qum 
With respect to each of a A 
three bases for comparison, than the 
model proves less adequate tha 
Tange-frequency model. 


CATEGORY JUDGMENT: A MODEL 


Could another model be developed 
which would handle the present data 
and yet remain faithful to the general 
approach of AL theory? One possi- 
bility is that the simple weighted-mean 
model neglects important stimulus dif- 
ferences. As applied to other situa- 
tions, AL theory weights each of the 
series stimuli in direct proportion to 
its frequency of presentation. But the 
background, anchors, and other stimuli 
given special attention in the task for 
judgment are weighted in accordance 
with their observed effects upon AL. 
If greater weighting were given the 
extreme values of the regular series, 
the smallest and largest squares in the 
present distributions, the AL predic- 
tions would be greatly improved. This 
change in the model would handle the 
effects of range. It might also be 
Possible to find some system for 
weighting the stimuli which would ac- 
count for the frequency effect, but then 
the scale of judgment could not be 
characterized by the single value, AL. 

If the simple weighted-mean model 
for AL does so poorly with the present 
data, how has it proved useful in a 
wide variety of other situations? 
Were the present experiments differ- 
ent in crucial ways? The shifts in 
AL, shown here for squares, have also 
been reported for lifted weights and 
patterns of dots (Parducci, 1963), for 
lines which vary in length (Parducci 
& Marshall, 1961a), and for numerals 
presented in ordered arrays or in ran- 
dom succession (Parducci, Calfee, 
Marshall, & Davidson, 1960; Parducci 
& Marshall, 1961b). The general 
forms of the present judgment func- 
tions were also obtained with the other 
dimensions, indicating that they are 
relatively independent of the specific 


stimulus dimension. 
The contrast between the previous 
success and the present failure of the 


AL model may simply reflect differ- 
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ences in the precision of the data used 
to test the model. As reported by 
Helson (1964), “groups judging 
identical stimuli may differ, on the 
average, by as much as a whole cate- 
gory step [p. 136].” And the classic 
study of distribution effects (Johnson, 
1944) used only a single subject for 
each distribution. None of the studies 
cited as consistent with the AL model 
was sufficiently sensitive to demon- 
strate the deviations from the model 
reported here. 

Although the present study ques- 
tions the AL account of category judg- 
ment, it does not necessarily contra- 
dict the AL description of the effective 
stimulus for judgment. The prime 
value of the AL approach may be in 
specifying what is being judged. Thus, 
AL theory describes how the color 
of a sample changes with change in the 
reflectance of the immediate back- 
ground, and it appears to provide a 
useful account of what happens to the 
memory for a standard when stimuli 
are interpolated between standard and 
comparison stimuli. The present data 
reflect only upon its adequacy as a 
theory of category judgment. 


Conclusion 


Consider the differences between the 
AL and the range-frequency interpre- 
tations of category judgment. AL 
theory characterizes the state of the 
1 with a single value, AL, 


organism 
which varies with the stimulus con- 
text. Like the mean of a set of 


verage of different 
]t represents the 
uli forming 


scores, AL is an a 
stimulus effects. e 
pooled effect of all the stim 1 
the context for judgment. Each judg- 
ment is determined by the relationship 
between the judged stimulus and. AL. 
Therefore, the entire scale of judg- 
ment is determined by AL. Although 
e in its simplicity, this formu- 


attractiv| [ 
t for the evidence 


lation cannot accoun 
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that judgments may vary markedly 
even though there is no difference in 
AL (Figures 6 and 10) or that there 
may be differences in AL even though 
specific stimuli evoke the same judg- 
ments (Figures 5 and 7). The theory 
of AL cannot account for differences 
in the form of the judgment functions. 
The range-frequency theory pro- 
vides a more detailed characterization 
of the relationship between judgment 
and different features of the stimulus 
context. Special emphasis is given to 
the psychological representation of the 
stimulus range, the effective stimulus 
extremes. Although these clearly de- 
pend upon the extreme values of the 
physical distribution, further research 
is needed to clarify this dependency. 
The range-frequency theory also em- 
phasizes the frequencies with which 
different categories are used. The 
present model assumes that there is a 
response tendency toward equ 
of the judgment categories s 
prediction of the judgments 
in part upon the frequencies wi 
stimuli are presented from e 
of the range. Instead of comparing 
each stimulus with а single value, AL, 
the subject Compares each stimulus 
with a set of category limens, This 
set is itself an average of two other 
sets, the hypothetical range and fre- 
quency limens. The subject could not 
describe these hypothetical limens, but 
his judgments are made as though he 
were compromising between them. 
The advantage of th 
quency approach is that 
the effects of v 
frequency for th 
available. These 


al use 
O that 
depends 
th which 
ach part 


е range-fre- 
it describes 
ariation in stimulus 
€ most reliable data 
contextual effects i]. 
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lustrate the nature of category judg- 
ment, and that is what the theory of 
adaptation level was designed to 5 
plain. "The relative success of the pres- 
ent model indicates that the range- 
frequency theory provides a more ade- 
quate account of category judgment. 


REFERENCES 
HeEtson 


H. Fundamental problems in color 
vision: I. The principle governing changes 
in hue, saturation, and lightness of non 
selective samples in chromatic illumina- 
tion. Journal of Experimental Psychology. 
1938, 23, 439-476. É 

HeEtsox, Н. Adaptation-level as frame Оп 
reference for prediction of psychophysica 
data. American Journal of Psychology, 
1947, 60, 1-29. 

Hetson, Н. Adaptation-level theory. New 
York: Harper & Row, 1964. le 
Јонхѕох, D. M. Generalization of a und 
of values by the averaging of practice A 
fects. Journal of Experimental Psycho 
ogy, 1944, 34, 425-436. Р 
McCrgtLANp, D. C. Factors ifene 
time error in visual extent. Journal of H 
perimental Psychology, 1943, 33, 81-95. Е 
ARDUCCL А. Range-frequency фора 
in judgment. Psychological Monographs, 
1963, 77(2, Whole No. 565). lb 
Рлкрисст, A., CALFEE, R. Су MARSHAL 
L. M., & Davipsox, L, Р. Context go 
in judgment: Adaptation level as а = d 
tion of the mean, midpoint, and med 
of stimuli. Journal of Experimental Psy 
chology, 1960, 60, 65-77. ЕЯ 
Parpucct, A., & Mansuau, L. M. Ср 
effects in judgments of length. Aen 
е of Psychology, 1961, 74, 576-585. 

a) 


Panbucct, A, & Marswatt, L. M. Supple- 
mentary report: The effects of the mea ч 
midpoint, and median upon айарын 
level in judgment, Journal of ДАР 
mental Psychology, 1961, 61, 261-262. 
(b) 


Toncrnsox, W. 


S. Theory and methods of 
scaling. 


New York: Wiley, 1958. 


(Received September 22, 1964) 


Psychological Review 
1965, Vol. 72, No. 6, 419-444 


PERCEPTUAL ADAPTATION TO INVERTED, 
REVERSED, AND DISPLACED VISION * 
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Recent research has shown that a simple fo: 
sual field consists primar! 


e—a change in the felt position of the arm seen 
visual, motor, or visuomotor change. 
ion (to inversion, reversal, and other 
Iso be understood as resulting from 


produced displacement of the vi 


proprioceptive chang 
through prisms—rather than a 
More complex sorts of adaptat 
optical transformations) can a 
changes in the felt locations of раг 
parts. Contrary to the usual empir: 
be very stable, 
When the 2 senses p 
sense that changes. 


rovide discrepai 


i For over a century, psychologists 
wis been experimenting with optical 
evices that displace, reverse, or in- 
Pes the retinal image. When a per- 
Era first puts on such a device, he 
наа _things he reaches for and 
a into things he is trying to walk 
me But after a while he adapts. 
à € ends up behaving normally despite 
ne optical distortion. 

Typically, experimenters have ac- 
cepted this adaptation as evidence for 
RE against various theories about the 
nee of visual space perception in the 
" ant. But even if one hesitates to 
рпеш from adult behavior to in- 
development, adaptation to opti- 
à distortions is of interest in reveal- 
ЗЕ how perceptual-motor systems 

ork and how they can be modified. 
Р рй in part оп а doctoral dissertation 
ubmitted to the Department of Psychology, 
ane University. Preparation of this 
ine was supported by NIMH Grant MH- 
Hat and NSF Grant GB-3546. Some of 
NI research cited was supported by NSF and 
N MH predoctoral fellowships and by an 
t SF postdoctoral fellowship. I am grateful 
ns Charles R. Hamilton, Judith К. Harris, 
qued Held, Alice Isen, R. Duncan Luce, 
acob Nachmias, Fred Stollnitz, and Benja- 
min W, White for their helpful criticisms and 
Suggestions. 
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whereas the position sense 


rm of adaptation to prism- 
ily of a 


ts of the body relative to other 
icist assumption, vision seems to 
is remarkably flexible. 


nt information, it is the position 


Recently there has been much con- 
cern with the mechanisms for adapt- 
ing to optical distortions and with the 
conditions that are necessary for such 
adaptation to take place. Less atten- 
tion has been given to the end product 
of adaptation. What change does the 
adaptation procedure produce in the 
subject? How does the adapted sub- 
ject differ from one who has not 


adapted ? 
Previous investigators have offered 
For 


diverse answers to this question. 
example, Kohler (1964) and Taylor 
(1962) believe that adaptation results 
in a change in visual perception. 
Smith and Smith (1962), on the other 
hand, claim that it consists mainly 
of learning specific motor responses. 
Held and Freedman (1963) say that 
adaptation "represents à change in 
state of the relevant sensorimotor con- 
trol system" based on the storage of 
"newly correlated information" de- 
rived from "the one-to-one relation 
between movement and its sensory 
feedback [p- 457]-" 

This paper proposes another inter- 


pretation of adaptation : that it con- 
es in the position sense 


sists of chang: 
for various parts of the body. 
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change in position sense has been 
clearly demonstrated in one form of 
adaptation to displaced vision. The 
extension of this interpretation to 
other forms of adaptation is more 
speculative but seems to make sense 
out of a mass of otherwise perplexing 
data. (For summaries of earlier ex- 
perimental work on adaptation see 
Held & Freedman, 1963; Kohler, 
1964; Smith & Smith, 1962; Taylor, 
1962.) 


The Position Sense 


Even in the dark we can perceive 
the relative locations of the various 
parts of our bodies. The sense that 
enables us to do this will be referred 
to as the position sense, and the per- 
ception of the position will be called 
a felt position. Changes in the posi- 
tion sense will be called, for want 
of a better adjective, proprioceptive 
changes. (The term kinesthesis will 
be restricted to the perception of move- 
ments of parts of the body.) 

The position sense is a psychological 


phenomenon; its physiological basis 
has not yet b 


lished. Recep 
to play the major role (Rose & Mount- 
castle, 1960) ; 
ity may enhan 


: 1962b; Lud- 
vigh, 1952). Jf motor signals do play 


this role, though, the nervous system 
must somehow register the positions 
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called for, not the movements; other- 
wise we would lose track of body parts 
whenever they were moved (or kept 
from moving) by an outside force. 

Although information registered by 
the position sense is usually available 
to introspection, we are not constantly 
aware of the locations of all of our body 
parts. And sometimes a subject’s Lp 
scious perception of the positions 0 
some body parts (especially his eyes) 
is vague and variable, even though 
there is abundant behavioral evidence 
that these positions are being Eng 
into account" accurately. In genera, 
the hypotheses presented in this paper 
apply whether the position informa- 
tion is conscious, potentially conscious, 
or not available to consciousness. 


L 
ADAPTATION TO DisPLACED RETINA 
IMAGES 

Arm Adaptation 


Adaptation to inversion or reversal 
of the visual field may take many бау 
or even weeks. However, as pe 
holtz reported in 1866, a person m 
adapt to sideways displacement of ae 
visual field in just a few minut 
(Helmholtz, 1962b, p. 246). dis- 

If you look through prisms that a 
place the apparent locations of ud 
Objects to the right, for example, ж 
try to reach quickly for somethit to 
you will miss it by reaching too far re 
the right. But after just a few i 
attempts, your aim will improve o 
siderably,” When the prisms are p 
removed, however, you will reach th 
far to the left. For convenience, du 
the improved reaching while agere 
Prisms and the aftereffect when go 
are removed will be referred to ew 
adaptation (ie. adjustment to a 
Conditions), since they are e Ше 
ly manifestations of a single "айар" 
lying change. The amount ot icate 
tation (the adaptive shift) is pow 
by the difference between a subj 
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responses on pre- and postadaptation 
tests. (During these tests the sub- 
ject must not be allowed to see his 
hand; otherwise, by moving it slowly 
and guiding it visually, he would al- 
ways be able to point correctly.) 

Proprioceptive Changes. If a per- 
son’s eyes are closed when he first 
puts on displacing prisms, he is sur- 
prised when he opens his eyes and 
looks athis hand. Because the prisms 
Shift its visual image, his hand does 
not appear to be where he felt it was. 
If the discrepancy between the seen 
and felt locations of the hand is to be 
eliminated, either the person's visual 
perception or his position sense (or 
both) must shift. 

According to the proprioceptive- 
Change hypothesis, the subject comes 
to feel that his arm is where he saw 
it through prisms—even though this 
makes that arm's position sense er- 
roneous (nonveridical). That is, after 
Such a change, the subject's judgment 
of that arm's position relative to any 
Other part of the body will be incorrect. 
lf the prisms are removed and the 
Subject tries (without seeing his 
hand) to reach for a target that he sees 
in a certain place, he will move his 
hand until he feels that it is in that 
place—but it will actually be off to 
one side of it. The same thing will 
happen if he tries to point at a sound 
Or simply to point straight ahead. 
Only when judging the whereabouts 
of his hand relative to objects seen 
through prisms will he be accurate. 

It is not clear a priori whether a 
Proprioceptive shift would make a sub- 
ject misperceive arm positions other 
than those he saw while adapting. 
Since neurons in the proprioceptive 
System have rather large receptive an- 
gles ( Mountcastle, Poggio, & Werner, 
1963), a change in the operating level 
of proprioceptive neurons in the cen- 
tral nervous system might exert an 
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effect over a wide range of arm posi- 
tions. At any rate, the presence or 
absence of such a shift should depend 
mainly on the actual position of the 
arm, not on the movements by which 
the position was reached. 

Other Interpretations. Five other 
simple, plausible conceptions of the 
nature of adaptation can also account 
for the rapid improvement in reaching 
for objects seen through prisms. Each, 
however, suggests a different set of 
predictions about other behavior. 
These five conceptions, which are 
often implicit rather than explicit in 
previous investigators’ writings, have 
been presented in greater detail else- 
where (Harris, 1963a). They are 
described briefly below, together with 
some of their predictions about a sub- 
ject who adapts by pointing with one 
arm, using a stereotyped arm move- 
ment, at a single target seen through 
prisms. 


1. Conscious correction of one's 
aim. When the subject misses the 
target, he realizes that the prisms are 
deceiving him about the target’s loca- 
tion and so deliberately aims to one 
side of visual targets; when the prisms 
are removed, he goes back to pointing 
normally. 

2. Altered visual perception. A 
changed translation from retinal image 
to perception makes a target which at 
first looked off to the side appear to 
be straight ahead. This new percep- 
tion can be demonstrated by any ap- 
propriate judgment of, or response to, 
a visual target seen with or without 
prisms. 

3. Reorientation of the perceptual 
frame of reference. Perception of all 
external stimuli, visual or auditory, 15 
shifted to one side; perception of the 


however, is unaffected (if per- 


arms, í 
shifted too, the 


ception of the arm 
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TABLE 1 


Test PERFORMANCE PREDICTED By Six INTERPRETATIONS 
OF ADAPTATION TO DISPLACED VISION 


Propriocep- 
tive change 
in the arm 


Conscious 


Test task correction 


| 
Visual | Frame of 


Visuomotor Motor 
perception 


reference | recorrelation learning 


Same as during adap- + + 
tation® 

Pointing at visual tar- + 
get without prisms 

Pointing at visual tar- + 
get with unexposed 
hand 

Verbal judgment of ? 
location of visual 
target 

Pointing at auditory + 
target» 

Verbal judgment of 
location of auditory 
target» 

Pointing straight + 
ahead» 

Pointing at visual tar- + + 
get with different 
arm movements 

Pointing at visual tar- ? 
gets in different 
locations 

Judgment of distance + 
between hands 

Judgment of location + 
of Passively moved 
adapted arm rela- 
tive to visual target 

Pointing with adapted + 
hand at unexposed 
hand» 

Pointing with unex- + 
posed hand at 
adapted hand> 


+ + + s: 
4 4 + P 


Note.—The subject adapts by 


Я Pointing wi i " i target 
seen through prisms, A + indicates the prediction drm, naan & stereotyped arm movement, at a single 


used during adaptation. 


^ Except that (as with 
his accuracy. 


^ While blindfolded. 


subject would show no adaptive shift 
in pointing at targets), 

4. Visuomotor Tecorrelation, Vis- 
ual perception does not change, but a 
given visual input is paired with a 
different motor output. Since only the 
visuomotor system used during adapta- 


all the other tests) the subject cai 


4 f 1 : task 
ive shift as large as that obtained with the 


и 2 ” bout 
nnot see his hand and receives no information al 


. a nd 
tion is altered, the unexposed arm а! 


all nonvisual targets are unaffected. 
KA Motor-response learning. The 
practiced arm acquires a new moto 
response to stimuli from a given spr 
tial location regardless of their a 
dality. There is a generalization 
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crement when the subject uses arm 
movements that differ from the prac- 
ticed one. 


Table 1 summarizes the predictions 
of the proprioceptive-change hypothe- 
sis and the other five conceptions. 
Several other, more sophisticated theo- 
ties are discussed briefly at the end 
of this paper. Unfortunately, these 
theories often make equivocal predic- 
tions, or none whatever, about many 
of the tests listed in Table 1. 

Experimental Findings. Harris 
(1963a, 1963b) carried out six of the 
tests listed in Table 1. The subjects, 
whose heads were held stationary by 
a bite board, adapted by pointing for 

minutes at a visual target seen 
through prisms that displaced its 
image 11° to the right or left. Adap- 
tation was found to produce sizable 
and significant adaptive shifts, which 
Were virtually identical whether meas- 
ured by pointing at visual targets, at 
auditory targets, or “straight ahead.” 
The shift was no smaller when sub- 
Jects pointed at targets several inches 
from the one they had practiced on, 
even though the arm movements used 
then differed from the practiced one. 
However, adaptation had little or no 
per on pointing with the unexposed 
"and. Nor did it affect judgments 
of whether a given auditory target 
Pa straight ahead. (Hein & 
"n 1960, had previously reported 

at, with a similar adaptation proce- 
dure, there was no change in judged 
location of visual targets.) Others 
have independently demonstrated the 
Adaptive shift with auditory test tar- 
Sets and with pointing straight ahead 
(Pick, Hay, & Pabst, 1963) and the 
absence of any shift in pointing with 
the- unexposed hand (Н. B. Cohen, 
1963; Hamilton, 1964a; Mikaelian, 
1963; Scholl, 1926). ^ Subsequent 
Studies have also confirmed these three 
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findings (Goldstein?; Hay & Pick, 
in press; McLaughlin & Bower, 1965; 
McLaughlin & Rifkin, 1965). 

On the basis of these results, five 
of the notions listed in Table 1 may 
be ruled out. The data can be ac- 
counted for only by the first interpre- 
tation: that adaptation consists of a 
change in the felt position of the 
adapted arm relative to the rest of the 
body. 

Further Tests. The proprioceptive- 
change interpretation. implies that a 
subject should make errors in judging 
how far his adapted hand is from other 
parts of his body—for example, his 
other hand. This prediction was tested 
by having subjects (who had adapted 
their right arms by pointing at a tar- 
get seen through prisms) move their 
unexposed hands to specified subjec- 
tive distances from their adapted hands 
while blindfolded (Harris, 1963a). 
After seeing their right hands shifted 
to the right by base-left prisms, sub- 
jects felt their hands to be farther 
apart, at a given physical distance, than 
when their hands were not adapted. 
Subjects who wore base-right prisms 
felt their hands to be closer together. 
These results demonstrate that there is 
in fact a change in the felt location of 
the adapted hand relative to the other 
hand. 

During these tests, the subject was 
not allowed to make any active move- 
ments with his adapted arm. Thus, a 
simple motor-learning or conditioned- 
response theory of adaptation is inade- 
quate: Although self-produced move- 
ments may be an essential precondition 
for adaptation, they are not a neces- 
sary part of the end product. The 
adaptive shift is evident whether the 
subject actively points at a target dur- 
ing the test or a luminous target 15 
moved until he says it is right over his 

2 Donald Goldstein, personal communica- 
tion, June 1964. 
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stationary hand (Hamilton & Hill- 
yard?). A change in the position 
sense will indeed affect motor re- 
sponses of the adapted arm, but the 
change does not itself consist of newly 
acquired motor habits. On the con- 
trary, it is a perceptual change (in the 
felt position of the adapted arm). 

The most direct evidence for the 
hypothesized change in position sense 
is obtained when the subject points 
with his unexposed arm (which points 
correctly at all other targets) at 
his stationary adapted arm. Harris 
(1964) results fell just short of sig- 
nificance, but more recently Efstathiou 
and Held (1964) and Goldstein ? have 
found large and significant shifts on 
this test, as well as on pointing with 
the adapted hand at the unexposed 
hand. Both findings were anticipated 
by Scholl (1926). 

Related Findings. Several other re- 
cent studies fit in well with the proprio- 
ceptive-change hypothesis, Bossom 
and Hamilton (1963) and Hamilton 
(1964b) found that adaptation to dis- 
placed vision—in contrast to visual 
discrimination learning—shows com- 
plete interocular transfer and no inter- 
manual transfer in split-brain mon- 
keys; the adaptation is specific to the 
arm, not the eye, Nielsen (1963), 
Hay, Pick, and Ikeda (1965), Rock 
and Victor (1964), and Wertheimer 
and Arena (1959) have demonstrated 
that vision may immediately and com- 
Pletely dominate the position sense 
when the two disagree, a finding anal- 
ogous to the smaller but longer lasting 
adaptive shifts discussed above, 

All in all, it Seems reasonable to 
conclude that, when a Person watches 
one hand through prisms with little 
head movement, the adaptation is 
mainly a change in the felt position 
of that arm relative to the rest of his 


3 Charles R. Hamilt 


milton and S, A. Hillyard, 
personal communicati 


On, August 1964. 
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body. Although their own data did 
not rule out all alternative hypotheses, 
other investigators have independently, 
at about the same time, reached similar 
conclusions: that such adaptation takes 
place in the “kinesthetic spatial kd 
tem" (Hochberg, 1963; Pick et a 
1963) or, more specifically, in ds 
adapted arm's position sense (Найш- 
ton, 1964a, 1964b). А similar [^ 
pothesis was proposed earlier by 
Scholl (1926). 


Head-Body Adaptation 


Another way to adapt to displace? 
vision is simply to walk around - 
wearing prisms (Hay & Pick, е. 
press; Held & Bossom, 1961; Koh em 
1951, 1964; Taylor, 1962). The p^ 
sults are quite different from those ih 
arm adaptation. When presented ni 
a visual target after the prisms are С 
moved, the subject points incorrec 7 
with both arms, even if he saw om 
one through prisms (Bossom & pen 
1957), and says the target 100 es 
Straight ahead of him when it is eld 
tually somewhat off to one side (He 
& Bossom, 1961; Kohler, 1964). " 

Is this type of adaptation, tien 
completely unlike arm абаран 
Probably поё, Just as the felt neon 
ship between arm and body is a 
by moving the arm while Маи 
prisms, so perhaps the fett uen 
Ship between head and body is pere 
by moving the head while mp 
prisms. The three investigators hesis 
independently proposed this hypoth 


: dapta- 
*It is convenient to think of arm a f the 


tion as a change in the felt position x the 
adapted arm, with that of the rest er, the 
body remaining unchanged. Нона if the 
Same phenomena would be observe e and 
Perception of the arm stayed the eap . 
that of all of the rest of the body “els, 
Strictly speaking, we can detect o. This 
changed relationship between the tw ,d-body 
is even clearer in the case of hea 
adaptation. 


ADAPTATION то DISTORTED VISION 


—Hamilton (1964a), Harris (1963a, 
1963c), and Mittelstaedt (1964)— 
were unaware that Kohler (1951, p. 
23) had already observed just such 
a phenomenon: A subject who wore 
prisms developed the “habit” of hold- 
Ing his head turned 6?-9? to the right 
of his body midline but was “com- 
pletely unaware" of the deviation. He 
felt that his head was pointing straight 
ahead, 

_ It is the unawareness, not the turn- 
ing, that is crucial. Contrary to Smith 
and Smith's claim (1962, pp. 92, 116- 
117), a “compensatory reaction” of 
turning the head cannot in itself coun- 
teract the prism-produced visual dis- 
Placement: The perceived location of 
an object (relative to one’s body) 
does not normally change when one 
turns one’s head, because the new 
Position of the head is taken into 
account, Perception of the object 
changes only if a subject misperceives 
the orientation of his head. 

Whereas misperception of just one 
arm affects only tests that involve that 
arm, virtually any test will show the 
effects of misperceived head orienta- 


А MEL Prismentrüger ist man stündig gez- 
MET Auge und Kopf gegenüber der 
ТОРЕНИН etwas verdreht zu halten. 
did in der Tat liess sich als Nachwirkung 
aie a aufgeswungenen ‘Lebensgewohnheit’ 
aid fuer kbar Verdrehung zwischen Kopf 
Auf umpf nachweisen, welche aber der 
por nerksamkeit | der Vp. vollkommen 
аў ging. Sie meinte gerade und unverdreht 
К. Stehen, während sie in Wirklichkeit den 
2001 nach rechts gedreht hielt (6-9 Grad 
we der Kérpermediane abweichend). Kor- 
igierte man aber die Rechtslage des Kopfes, 
$o entstand im Erleben der Vp. der Eindruck 
38] Linksverdrehung (cf. Kohler, 1964, p. 


The Fiss and Gleitman translation 
(Kohler, 1964) differs, in many places, from 
the German papers on which it is based 
(Kohler, 1951, 1953). Though usually 
slight, the discrepancies are sometimes mis- 
leading at crucial points. Therefore, the 
Present writer’s translations are occasionally 
&iven instead. 
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tion. If a subject feels his head to 
be pointing straight ahead of his body 
when it is really somewhat turned, 
then when he sees an object directly 
in front of his nose he will incorrectly 
Gf he is not wearing prisms) perceive 
that object to be straight ahead of his 
body. If he tries to point at it with 
either hand, he will point straight 
ahead of his body and thus point in- 
correctly. (Such misperception of 
head position would, of course, lead 
to improved accuracy of performance 
while the prisms are on.) Similar re- 
sults will occur even if the test ap- 
paratus constrains the subject to hold 
his head straight relative to his body, 
as in Held and Bossom’s (1961) pro- 
cedure. When Kohler forced his sub- 
ject to point his head straight ahead, 
the subject felt that it was turned sev- 
eral degrees to the left (1951, pp. 23- 
24). 

A change in the felt relationship 
between head and body necessarily en- 
tails a change in the perceived direc- 
tion of visual targets relative to the 
body. But it would be inaccurate to 
describe such adaptation as solely a 
change in visual perception, since, for 
example, altered perception of head 
orientation would also result in altered 
auditory localization. 

Intermanual Transfer. A number 
of investigators have found that if a 
subject watches one hand through 
prisms, with little head movement, the 
adaptation is completely or almost 
completely confined to the exposed 
hand. Helmholtz (1962a, p. 157), 
however, reported considerable adap- 
tation of the unexposed hand as well. 
How can these findings be reconciled ? 

A plausible answer was suggested 
independently by Hamilton (1964b) 
and Harris (1963a, 1963c). They 
both noted (as did H. B. Cohen, 1963) 
that subjects whose heads were im- 
mobilized while they adapted showed 
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little intermanual transfer, whereas 
those who were free to move their 
heads, as Helmholtz was, exhibited 
considerable transfer to the unexposed 
hand. Hamilton and Harris concluded 
that moving the head while wearing 
prisms leads to a change in the felt 
position of the head relative to the 
body, which would make the subject 
mispoint with both hands even if he 
never saw them through prisms. If 
he did see one arm through prisms, 
he would show a larger aftereffect 
with that arm than with the unexposed 
arm (since, in addition to the error 
caused by misperceiving the orienta- 
tion of his head, there would also be 
a misperception of the exposed arm's 
orientation), thus manifesting "partial 
intermanual transfer." 

Tf this analysis is correct, the term 
intermanual transfer. is, in this con- 
text, something of a misnomer. Trans- 
fer implies that the adaptive change 
in one arm (or relevant parts of the 
nervous system) somehow spreads to 
or induces a similar change 
other arm (or contralateral part 
nervous system), 
sibility has not be 
out (as Hamilton, 


in the 
of the 
Although this pos- 
en definitely ruled 


t 1964b, noted), it is 
simpler to assume that the measured 
adaptation in the unexposed arm re- 
sults wholly from head-body adapta- 
tion, which affects both arms equally, 
and that there is in 


| addition some arm 
adaptation of the exposed arm. 
Wooster's Experi 


iment. The concept 
of altered position sense of the head 


О never saw their hands 
through prisms and so had no 
edge of error” 


accurate, Wi 
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ber of possible reasons for the pd 
provement, tested them in further e 
periments, and found that none fitte 
all of her data. | 

Although Wooster's subjects did not 
walk around while wearing. pun 
they were free to move their hea a 
and so could have undergone ант 
body adaptation, Had Wooster an 
the unpracticed arm, she might po 
been even more surprised to find [^ 
it had improved just as much as th 
practiced one. . н 

A change in the propriccepiivey 
perceived relationship between ipae 
and body could also account for air 
of the findings reported by шнен 
Kravitz, and Lindauer Cee 
Bossom (1964), and by pang ae 
and Rifkin (1965), as well as for | 
aftereffects of the incidental veiis 
displacements produced by Stratton, 
(1897, p. 471) and Kohler’s (196% 
p. 32) inverting goggles. 


Eye-Head Adaptation 


All of the phenomena ascribed me 
to head-body adaptation (except ge 
Kohler's direct observation of a iar 
in felt head position) might Hp 
well be due to a change in the reg es 
tered relationship between the са à 
and the head (Harris, 1963a) the 
modification of the “judgment в" ltz 
direction of the gaze,” as m 
(1962, p. 246) put it. Indes 
Kohler (1964, р. 32) says that in По 
experiments alterations in "kinest ften 
sensations” from the eyes were O 
encountered. з eye- 

Unlike head-body adaptation, сае 
head adaptation would not affect sho 
tory localization. Thus, a subject “his 
misperceived the orientation T 
eyes should misperceive the pe v 
of a visual target relative to pert 
а Sound from an unseen 5 have 
(Harris, 19632). Recent cere oe 
demonstrated just such an au 
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visual mismatch. After certain adap- 
tation procedures, a subject errs in 
judging where on a luminous visual 
scale a sound is coming from (Hay & 
Pick, in press; Wallach & Bernheim 5). 
He points in one direction at a light 
and in another at a sound which is 
actually in the same place (Hay & 
Pick, in press; McLaughlin & Bower, 
1965). 

These and other findings reported 
by Hay and Pick (in press), Mc- 
Laughlin and Rifkin (1965), and 
McLaughlin and Bower (1965), as 
they acknowledge in their later papers, 
are all attributable to a change in reg- 
istered eye position plus a change in 
the exposed arm's position sense, with 
the amounts of the two changes vary- 
ing during the course of adaptation." 

Because incorrect registration of eye 
Position would entail incorrect locali- 
zation of all seen objects, one can say 
that eye-head adaptation alters visual 
perception. But this alteration is fun- 
damentally different from purely vis- 
ual modifications such as dark adapta- 
tion and localized figural aftereffects. 
It is more akin to altered position 
sense in the arm or head. Thus it 
Seems inadvisable to make a sharp dis- 
tinction between “proprioceptive adap- 
tation" (of the arm) and “visual adap- 
tation.” ` Such a distinction might, for 
example, lead to a fruitless search for 
modifications in the pathways connect- 
ing the retina to the visual cortex. 

Half Prisms. Prisms that cover 
only the upper half of the eye displace 
the upper half of the visual field rela- 
tive to the lower half, making а 
Straight vertical line look discontinu- 
ous. Kohler (1964) reported that 

? Hans Wallach and Joseph Bernheim, per- 
sonal communication, December 1963. 

7 Part of the shift that Нау and Pick (in 
Dress) attributed to arm adaptation may 
actually be due to head-body adaptation; Hay 
and Pick's tests do not distinguish between 
the two. 
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subjects who adapt to half prisms say 
that the line eventually looks straight 
and unbroken most of the time despite 
the discontinuous retinal image. 
Although this adaptation sounds like 
a purely visual change—a change in 
perceived relationships within the vis- 
ual field—Kohler’s other observations 
indicate otherwise. When an adapted 


subject was asked to move his eyes 
straight up and down in the dark, 
Kohler says, the subject actually 


moved them in a jagged line, with a 
sideways jump approximately in the 
middle of the movement (“einen seit- 
lichen. Sprung ungefähr in der Mitte 
der Bewegung"—Kohler, 1951, p. 73; 
сї. 1964, p. 93). With more rapid 
eye movements, the path became di- 
agonal. But the subject "always 
thought that his eyes moved vertically 
and without sudden deflections [1964, 
р. 94].” Apparently, the subject per- 
ceived a broken line as straight only 
because he felt that the jagged eye 
movement he made in scanning it was 
straight. What happened when the 
subject fivated the dividing line be- 
tween the prism and nonprism areas? 
Kohler (1964, p. 83) says explicitly 
that, when fixated, vertical lines looked 
just as discontinuous after many days 
of adaptation as they had at first. 
Clearly, there was no change in the 
purely “pictorial” aspect of visual per- 
ception, but only in those perceptions 
of visual location that depend on the 
registration of positions and move- 
ments of the eyes. Note that the adap- 
tation did not entail any change in 
scanning behavior: When scanning 
the discontinuous line, the subject 
made essentially the same eye move- 
ments after adapting as he had before. 
The only change was that a jagged 
eye movement Was interpreted as 
straight. This is a perceptual change, 
not the acquisition of new motor re- 
sponses. 
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Curvature. Straight vertical lines 
look curved when viewed sees a 
sideways-displacing prism because the 
prism displaces the top and bottom of 
the visual field more than the middle. 
The curvature is a set of relative dis- 
placements of the same sort as pro- 
duced by a half prism. So perhaps 
adaptation to curvature also involves 
altered registration of eye movements 
without any change in scanning be- 
havior. After adapting, the subject 
may feel that his eyes are moving in 
a straight line when they are actually 
tracing out a curve.® Perhaps the “un- 
stable aftereffect” experienced by one 
of Kohler’s subjects (“straight ob- 
jects—for example, long and heavy 
steel pipes—curved and straightened 
out while the amazed subject was in 
the very act of looking at them"— 1964. 
pp. 37-38) was due to al 
ning and fixation, 

An analogous case of curvature adap- 
tation, resulting from altered kines- 
thetic perception of movements of the 
arm, was recently studied in collabora- 
tion with Judith R, Harris (Harris, 
1964 Subjects moved one hand 
back and forth along a horizontal 
Straight line while looking through 
prisms that made the line look curved 
upwards or downwards. Before this 
Practice and again afterwards, they 
Were asked to draw straight horizontal 
lines while blindfolded. The predic- 
tion was that if a subject ran his hand 
along a straight line that looked curved 
upward, for example, a straight hori- 
zontal arm movement would come to 
feel curved upward, so the subject 
would compensate and draw a down- 
ward curve in order to feel that his 


ternate scan- 


5 This idea was 
with Julian Hochb: 


ast two 
the registered eye- 


n applies to only one 
kind. 
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arm was moving in a straight aee 
A significant shift, in the predicte 
direction, was found. Note that Ee 
shift cannot be due to an intermoda 
figural aftereffect, because the shift с 
in the wrong direction; it cannot be 
motor learning, because the arm move- 
ments made during practice were ac- 
tually straight, 


ADAPTATION TO INVERTED RETINAL 
IMAGES 


Is a proprioceptive-change pags d^ 
tation appropriate when subjects adapt 
to optical transformations more Aa 
tic than displacement? Stratton 
(1896, 1897) reports on his adaptation 
to “reinversion” of the retinal image 
indicate that the answer is yes. 


Stratton’s Experiments 


Proprioceptive Changes. жы A 
reports are indeed difficult to compa 
hend—at times they sound bizarre, EB 
times, self-contradictory. But it iy 
clear that Stratton experienced ie 
Prioceptive changes similar to ne 
considered above, though far more ех 
tensive and less stable. 


When he first looked through v 
verting lenses, Stratton (1896) sa) 


lie 
tic the parts of my body were felt to 


where they would have appeared had he 
instrument been removed; they were ictus 
be in another position. But the older pape 
{+ localization was still the real loca 
tion [p. 614]. 

Soon, however, in the 
«+. the limbs began actually to feel. in Е 
place where the new visual Dacus es 
Ported them to be... . The seen борта 
thus became real things just as in eee 
Sight. I could at length feel my feet 5 was 
against the Seen floor, although the foor Son 
scen on the opposite side of the field o f the 
from that to Which at the beginning sensa- 
experiment 1 had referred these tactual E my 
tions, T could likewise at times feel ча new 
arms lay between my head and bae , 
Position of the feet; shoulders anc tances 
however, which under the euo ol 
could never be directly seen, kept 
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localization they had had in normal vision, 
In spite of the logical difficulty that the shape 
of the body and the localization of hands and 
feet just mentioned made such a localization 
of the shoulders absurd [p. 615]. 


Proprioceptive changes such as these 
account for the behavioral aspects of 
Stratton’s adjustment to inverted vi- 
sion.” [f the felt locations and move- 
ments of his hands and feet came to 
agree with their seen locations and 
movements, he would have no trouble 
reaching for or kicking things, whereas 
before adapting he had to move the 
limb in a direction that felt wrong. 
When the new proprioceptive percep- 
tions became stable enough to persist 
€ven when the limb was out of sight, 
Tesponses with that limb would be 
Completely normal with no need for 
Conscious deliberation. 

These proprioceptive changes also 
explain Stratton’s feeling that he had 
achieved a “reharmonization” of touch 
and sight: Whenever he touched an 
object with an adapted limb, he felt it 
to be where he saw it, because he felt 
the limb to be in a new location that 
agreed with its visual location. 

Upright Vision, Stratton was not 
Primarily interested in behavioral ad- 
Justments nor in proprioceptive or in- 
tersensory alterations. He wanted to 
find out whether the usual (inverted) 
Orientation of the retinal image is nec- 
essary for seeing things as upright. 
If so, Stratton (1896) said, “it is cer- 
tainly difficult to understand how the 
Scene as a whole could even tempo- 
тагу have appeared upright when the 
retinal image was not inverted [р. 
616]." Yet, he claimed, this was pre- 
cisely what happened. After several 

? Since Stratton's lens system rotated the 
retinal image through 180°, he actually 
Sdapted both to inversion and to reversal. 

Ohler’s (1951, 1964) experiments, using 
mirrors that inverted the retinal image with- 
Out reversing it, generally support Stratton's 
Observations. 


days of adaptation, the world seen 
through inverting lenses sometimes ap- 
peared to be "in normal position" 
(1896, p. 616), “right side up" (1897, 
pp. 358, 469), "rather upright than 
inverted" (1897, p. 354). Some psy- 
chologists have taken these statements 
as conclusive evidence of a change in 
Stratton's visual system. Others have 
maintained that Stratton's assertions 
mean nothing at all. Walls (1951), 
for example, insisted that Stratton's 
descriptions of the scene as "upright" 
were "entirely metaphorical" (p. 191) 
and that actually all that Stratton 
achieved was a harmony between cur- 
rent perceptions and inverted eidetic 
imagery of objects outside the field 
of view (p. 200). 

Stratton himself, on the other hand, 
thought it was quite natural for things 
to come to look upright again since 
he believed that “harmony between 
touch and sight, . . . in the final anal- 
ysis, is the real meaning of upright 
vision [1897, p. 475].” But although 
“harmony between touch and sight" 
might indeed make the perceived ori- 
entation of the body and of the seen 
world agree, both body and world 
might still be perceived as inverted 
rather than upright. Perceived up- 
rightness must depend on some other 
factor. That factor, as many investi- 
gators have pointed out, is the sensa- 
tions of pressure and tension in the 
feet, legs, and body produced by the 
pull of gravity. Recently, experiments 
on subjects with labyrinthine defects 
led Clark and Graybiel (1963) to sug- 
gest that such pressure cues, rather 
than labyrinthine cues, may in fact be 
the major determinants of the per- 
ceived direction of gravity. Under 
zero-gravity conditions, for instance, 
subjects perceive the direction of the 
surface that their feet are touching as 
downward (Simons, 1959). 

When Stratton first put on invert- 
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ing lenses, he felt gravity pulling quay 
from the seen location of the floor; 
“the general feeling was that the seen 
room was upside down; the body of 
the observer . . . was felt as stand- 
ard and as having an upright position 
[1897, p. 348].” But gradually he 
began to feel that his feet, then his 
legs and arms, then most of his body 
were all in "the place where the new 
visual perception reported them to be 
[1896, p. 615].” The new proprio- 
ceptive localization was not stable— 
sometimes he even seemed to feel his 
limbs in both the normal and the new 
locations at once (1897, pp. 345-346, 
465)—but when his legs and body 
were clearly felt to be in the new place, 
so, of necessity, were the gravitational 
pulls. Because the direction of the 
pull of gravity is, by definition, down, 
objects seen to lie in the same direction 
from the head as the legs were felt 
to be were perceived as down. So the 
floor looked “down,” making the room 
look “right side up.” 

Since Stratton’s head and shoulders 
“kept the old localization they had had 
in normal vision,” he should then have 
felt that his legs and body were not 
on the same side of his eyes as his 
chin and shoulders. In other words, 
his head should have felt inverted! 
This is evidently just what happened: 


Outer objects . . . frequently seemed to be 
in normal position, and whatever there was 
of abnormality seemed to lie in myself, as 
if head and shoulders were inverted and I 
were viewing objects from that position, as 


boys sometimes do from between their legs 
[1896, р. 616].10 


Stratton’s simile conveys exactly the 
sense in which things seen through 
inverting lenses looked upright. They 
did not look the same as they did be- 
fore the goggles were put on. Rather, 

10 “АЕ other times,” Stratton (1896) 


noted, “the inversion seemed confined to the 
face or eyes alone [p. 616].” 
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they looked upright relative to A 
felt direction of gravity, the way 
things look when seen from — 
the legs. If you set a book uprigh 
on the floor and look at it from be- 
tween your legs, you will see the po 
tom of the book as "down" and Ls 
top as "up." You will see the painter 
part of a capital A above the E 
part. In this sense, everything ae 
look upright. And yet, you will мн 
trouble reading the book—the let 


wil look rather like upside-down 
print. 


Kohler's Experiments 


Kohler's accounts. (1951, 1300) a 
adaptation to inversion help clari y io 
role of gravitational pulls on uro 
ceptively adapted body parts. ill saw 
a partly adapted subject, who sti fa 
the world as inverted, took hold е е 
string with a weight attached to à 
other end, he suddenly saw the en ре 
as hanging from the string instea The 
floating upward like a ape an 
explanation may be that the hands p 
arms are often the first parts © So, 
body to adapt (Taylor, 1962). ah 
when the weight pulled on pon ntion 
ject's arm and attracted his atte еге 
to it, he felt it pulling toward W er- 
he saw the floor and therefore * 
ceived that direction as down. 1960, 

Several writers (e.g, Klem, avita- 
р. 103) have assumed that m to 
tional cues provide a direct асс. 
reality—a veridical standard * 
visual perception, when sho 
error of its ways, conforms. on 
ing to the present interpret v 
ever, gravitational cues will : on 
inverted scene look upright e 
they are felt by some pep. 
adapted body part. Promi ay (pro- 
tational cues in an wmadapted з 
duced, for instance, bya Tm 
ing from the subject's € 
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make the scene look even more clearly 


inverted. 
Б" ме assume that the adaptive 

he ge is in the felt direction of grav- 
бє їп the visual system, we can 
"we sense of Kohler's (1964) report 
us ода subject, who had been wear- 
: hn erting goggles for several days, 
aid that "two adjacent heads, one 
upright, the other inverted, were both 
perceived as upright [p. 32]." Ap- 
oe one head (the physically 
vm оле) looked upright in that its 
Ges vas seen below its forehead ; the 
sna] | (normally oriented on the 
ТЄ | looked more natural, more 

cognizable as a normal face. 


Illusory Movements of the Visual Field 


Nae when you move your 
field ema objects enter your 
i ы view irom below and travel 
eld top. You perceive the external 
head d stationary. If you move your 
iig ownward while wearing invert- 
md ieri though, objects enter the 
Fi üeld from the top and travel 
ай уйга. Аз a result, the world 
wa to be moving rapidly down- 
у (With reversing goggles, side- 
[dm movements produce a similar il- 
нану After а few days the illusory 
the ging diminishes, until eventually 
BO appears stationary during 
ta movements (Kohler, 1964; Strat- 
n, 1897; Taylor, 1962). 

ny sort of adaptation may also be 
кез е closely related to proprioceptive 
han adaptation than to purely intra- 
a phenomena. Stratton (1897, p. 
) noted that he saw the world as 
ty only when he misperceived 
te direction in which he was moving: 
Movements of the head or of the 
body ... seemed to be toward that 
Side on which objects entered the vis- 
ual field, and not toward the opposite 
Side,” as they had felt when he first 
Put on inverting lenses. This sounds 
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like a kinesthetic change, which would 
stabilize visual perception without any 
change in the neural mechanism that 
normally takes head movements into 
account to yield a stationary visual 
world; only the felt movement of the 
head, the input to this mechanism, 
changes. 


ADAPTATION TO REVERSED RETINAL 
IMAGES 


Kohler (1953, 1964) has described 
in detail how subjects who wear right- 
angle prisms, which reverse their reti- 
nal images right for left, eventually 
achieve normal behavior and what he 
calls “correct seeing” (1964, p. 140). 
At first reading, his account is as be- 
wildering as Stratton’s. 


“Piecemeal” Adaptation 


When a person puts on reversing 
prisms, Kohler says, he initially 
reaches in the wrong direction for 
things, makes wrong turns, and sees 
all writing as mirror writing. In at- 
tempting to cope with reversed vision, 
the subject tries out various tactics, 
such as deliberately heading left when 
his goal appears to be on his right. 
As the subject adapts behaviorally, 
during the course of several weeks, he 
becomes able to walk, reach, and turn 
correctly without resorting to such 
“tricks.” Concurrently, Kohler claims, 
his visual perception changes in a pe- 
culiar piecemeal fashion: Some parts 
of the visual field are perceived cor- 
rectly while other parts remain re- 


versed. For example, after 18 days: 
Inscriptions on buildings, or advertisements, 
were still seen in mirror writing, but the 
ontaining them were seen in the cor- 
rect location. Vehicles driving on the “right” 

. carried license numbers in mirror writ- 
ing .. - the subject is capable of localizing 
both sides of, say, а *3" correctly (open to 
the left, the curves to the right) and still see 
it mirrorwise [1964, p. 155]! 


objects с 
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At this stage, even though the os 
ject's spontaneous behavior is usua у 
correct, he often becomes confused с 
makes “errors” when asked to attend 
to his “immediate visual experience. 

After many weeks, Kohler says, ea 
subject's behavior and vision are both 
reoriented. He achieves almost com- 
pletely correct impressions, even where 
letters and numbers were involved 
[1964, p. 160]." І 

When опе thinks of adaptation as 
a change in visual perception, these 
observations are incomprehensible. 
How can vision ever be partly right 
way round, partly reversed? 


Determinants of Judgments 


In attempting to make sense of 
Kohler’s puzzling observations, it is 
helpful to bear in mind four different 
determinants of what a subject says 
when the experimenter tries to find 
out whether he sees things right way 
round. The first two determinants 
are distinguishable aspects of percep- 
tion, whereas the other two are essen- 
tially irrelevant to spatial perception. 
Doubtless, people differ in which of 
these factors enter into their judg- 
ments in a given situation, and a given 
person may judge differently at differ- 
ent times. But it is often possible to 
find out operationally which factors 
the subject's report is based on and 
to design experiments that avoid the 
ambiguities of previous reports, 

1. Directional 


perception. When 
asked "Does that Object appear to be 


on your right or on your left?,” many 
people probably make a directional 
judgment relative to their dominant 
hand. If the object is seen to lie on 
the same side of the body as the right 
hand, the subject Says: "It's on my 
right" or "on my right-hand side.” 
The same kind of judgment can be 
elicited whether or not the words right 


and left are used, whether or not the 
subject has a dominant hand, and 
whether or not he refers the judgment 
to his hand: The experimenter can 
simply touch any spot on the subject 5 
body and ask whether an object ар" 
pears to be on the same side of his 
body as the touched spot. 

Such a judgment is based on one 
aspect of spatial perception—percep- 
tion of the location of an object rela- 
tive to some part of the body. ‘This 
is the sort of perception that ped 
guides motor behavior such as reac " 
ing for an object or walking toward E 
(cf. the concept of “manipulable re 
gions," Kohler, 1964, p. 163). iè 

2. Pictorial perception. Most of t Мі 
debate on adaptation to distorted vi 
sion has concerned this aspect of ea 
ual perception, though it has not e _ 
clearly differentiated from other е 
terminants of subjects’ ue. m 
Pictorial perception consists of vo 
ing at" the "picture" received by 8 
visual system (cf, Gibson’s, 1950, ae 
cept of “the visual field”). It is most 
obvious in successive comparisons, 
For example, we can ask a par 
whether an arrow he is looking at an 
pointing the same way as the og й 
tive in a painting he saw before б 
experiment began, ses of 

The perception of "clockwise e 
“counterclockwise” motion and Qe) 
or "west" on a map are probably = я 
torial perceptions for most e 
based on purely visual memo sia 
Thus, to test for changes np 
perception, we can keep a subject > the 
seeing any clocks or maps during er 
experiment and then ask him wur " 
something appears to be moving С ther 
Wise or counterclockwise, or whe na 
it is on the same side as the 9 © Аз 
clock or the east coast on a Ja ее 
long as the subject has a visual 1 not 
of a clock or a map, he neeg, an 
even think about the labels "right 
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“left,” nor about any part of his body, 
when making his judgment. (Occa- 
sionally a subject may make a direc- 
tional judgment when asked to make 
a pictorial one—for example, if he re- 
members that the locomotive in the 
painting was "going toward my right" 
—that is, toward his right hand. But 
Such exceptions have no theoretical 
importance once recognized for what 
they are.) Even when he uses the 
um right. and left, which are often 
ae ce of directional judgments, 
ad - ject may be making a pictorial 
ned pe Some people. habitually 
feld сЕ certain part ої the visual 
Bid as "the lower left corner" with- 
bo referring at all to any part of the 
ody. 
К к йди, A subject often de- 
сс ais first perceptions through 
EAN goggles as А strange,” “un- 
е, unfamiliar, new,” or "mir- 
m imaged. Later he describes them 
а normal," "natural," “all right," 
usual,” “familiar,” "right way round" 
^as Kohler, 1964, p. 142). Such de- 
n ptions are based almost entirely 
dm. experience with particular 
бе ү» or classes of stimuli and сап 
ibas hanged through. repeated visual 
n ervation, even without distorting 
C ectacles, For example, a person 
A an apprentice typesetter) 
ien practices reading mirror writing 
Danie Soon say that it is beginning to 
Jk "natural" or “all right." But 
neither his directional nor his pictorial 
Perception has changed. If asked to 
judge the location of part of a letter 
(relative to part of the body or to 
visual memories), a person gives the 
Same answer whether the letter looks 
familiar” or “strange.” 
. 4. Labels. It is risky to let a sub- 
Ject use the words right and left. 
First, the same word may be used 
to label two quite different sorts of 
Perception, pictorial and directional. 


Second, a subject wearing reversing 
prisms could decide to start calling 
everything right that he formerly 
called left, even if he had not adapted 
at all. And third, he may be incon- 
sistent or hesitant about which word 
to use even when his perception is 
completely determinate, stable, and 
clearcut; as Kohler (1964) put it, 
“there are people who always have 
trouble when asked to iell quickly 
where right or left is, but who never 
have difficulty in reaching for seen 
objects [p. 153]." Labels like right 
and leit do not affect perception; it is 
irrelevant that the subject has learned 
to call a certain direction left and an- 
other direction right." 


Proprioceptive Changes 


With these distinctions in mind, it 
is possible to reexamine Kohler's ob- 
servations and conclude that adapta- 
tion to reversed vision can be ascribed 
to a radical change in the felt location 
of the arms, legs, and body relative 
to the head and eyes, without any 
change in pictorial visual perception. 

Kohler (1953, p. 110; cf. 1964, p. 
153), in fact, did observe some pro- 
prioceptive and kinesthetic changes 
during the course of adaptation to re- 
versal, After several days of wearing 


the goggles, he reported, there was: 

. a weakening of the right-left orientation 
of the body image, which becomes uncertain, 
especially jn connection with movements that 
have been deliberately practiced in reverse. 
The subject may even turn left, with full 


11 Terms like "upright" and “inverted” are 
even more ambiguous. Saying that an object 
looks upright may mean that it appears to be 
in its usual position, that it looks the same 
as it did before inverting goggles were put 
on, that it looks the same as it feels, that its 
top is perceived to be pointing away from 
the direction of gravitational pulls, that its 
bottom appears to be near where one’s feet 
are, or that it is oriented appropriately to the 
rest of the visual scene. 
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B ^" with 
nce, when he does a "right face will 
his p closed. When he moves his — 
and hands, the kinesthetic Le шо 
movement-sensations аге peer A = = 
cord with the (reversed) visual a ке 
ultimately this leads to a dead end 
errors that cancel each other!).1? 


In a footnote, Kohler added: 


. by touch, doors, for example, seem $ 
open in a reversed direction (as — а 
with earlier), as if they had been porn ed 
around in the meantime. pepe "bas E 
experimental "right-left of the p ir ge 
remains unchanged in the shoulder an upp oa 
arm region, and from there it undertal es its 
new conquest. When the attention is Fon 
centrated on this region, there is almost never 
апу error.i? 


Clearly, Kohler regarded these kin- 
esthetic and proprioceptive changes as 
temporary aberrations of no theoreti- 
cal importance, leading only to a “dead 
end." Since proprioception and vision 
were both reversed, Kohler thought 
that both must proceed to a further, 
"correct" stage before adaptation 
could be complete, Stratton's reports, 


12 “Aber auch umgekehrt schwächt sich 
das Rechts-Links der Kórperfühlsphüre und 
wird unsicher, besonders im Zusammenhang 
mit jenen Bewegungen, die man absichtlich 
verkehrt eingeiibt hat. Die Vp. kann mit 
voller Evidens sogar bei geschlossenen Augen 
‘rechts ит” machen und dreht sich dabei in 
Wirklichkeit nach links. Sie macht Kopf- 
wendungen und Handbewegungen, 
kinästhetische Lage- und Bewegungsempfin- 
dung ganz mit der (verdrehten) visuellen 
Welt übereinstimmt. Was dabei letzten 
Endes herauskommt, führt aber in eine 
‘Sackgasse (swei Fehler, die sich gegenseitig 
aufheben!) ” 

13 “Das führt so weit 
z. B. Türen (ge 
aufzugehen schein 


deren 


, dass sogar im Tasten 
genüber früher) verkehrt 
en, als wären sie inzwischen 
versetzt worden. Worauf sich das vorexperi- 
mentelle ‘Rechts-Links’ im Küórpergefühl 
aber versteift und von wo es dann seinen 
neuen Vorstoss unternimmt, ist die Schulter- 
und Oberarmpartie. 


Wenn man darauf die 

Aufmerksamkeit konzentriert, gibt es kaum 
jemals eine Verwechslung” 

(This footnote refers to th 


rek e sentence that 
ends with “übereinstimmt.”) 
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however, suggest that proprioceptive 
and kinesthetic changes, far from 
being temporary and trivial, parems 
more and more extensive as айар 
progresses and are directly ы бе. 
for the “correct” perceptual judgmen 
that ultimately emerge. 2 

In order to clarify this interpreta- 
tion of adaptation to reversal, let p» 
consider a hypothetical experiment 1 
which we test the subject's aud 
by having him look at a Lee ip 
bearing an L on his left and an R n 
his right (Figure 1A). ester 
after putting on reversing sae 
the subject says that he feels that h ) 
right hand (the one he writes wit «d 
and the right side of his body are s 
the same end of the blackboar [Р 
namely (since he is looking at ad 
through reversing prisms), the B) 
with a backwards L on it (Figure 1B). 
But when he holds up his writing yn 
and looks at it, he sees it nearer to th 
backwards К. e 

Now he starts adapting. If the aed 
Prioceptive-change hypothesis is Send 
rect, there is a change in the felt wr 
tions of his hands relative to his pes 
That is, his writing hand not ved 
looks as if it is nearer to the ges 
wards R, it now feels nearer as tesi 
(Figure 1C). Thus the xar ER 
his right hand to be п {һе (phy 
cally) left side of his bo у. й 
Suppose we ask the subject to por 
right. He most likely peccant 
mean toward the hand he bic раен 
Accordingly, he turns toward Nene 
he feels that hand to be, and t sub. 
perimenter writes down that the turn 
ject turned left when told to vitis 
right. The error is not due y^ hi 
certainty or “weakening of the Г E 
left orientation of the body cy 
if, instead of asking the oe Gan 
turn right, we touched his rig £^ he 
and asked him to turn toward if we 
would make the same error. 
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B. Before 
adaptation 


A. Actual 
orrangement 


D. Extensive 
adaptation 


C. Partial 
adaptation 


Fic. 1. A subject’s perceptions during the course of adaptation to re- 


versed vision, according to the proprioceptive-change hypothesis. 
{ме perception of the letters is visual; percep 
ody is proprioceptive. A: The actual physical arrangement. 


ject's perceptions when he first puts on reversing goggles. 
diate stage of adaptation, with only his arms 


D: The subject’s perceptions at an advanced stage of adaptation.) 


ject’s perceptions at an interme 
adapted. 


ig his (unadapted) right shoul- 
his je tar i or if the subject focused 
umm ention there, he would turn cor- 
y to the right. 
uam we now ask the subject which 
ba bie the blackboard appears to be 
of ted right, he may fall into the state 
of ndecision that is so characteristic 
, Kohler’s partly adapted subjects. 
rd right hand feels closer to the 
feels, R, whereas his right shoulder 
whi gre to the L. Depending on 
Gale region of his body he concen- 
oe T on, he can consider either the L 
ә ca R as being on his right. Thus 
fortÉ p switch his judgment back and 
hg rom “right” to "left" without 
tio slightest change in visual percep- 
i oa Moreover, Stratton's reports 1m- 
во ate that the partly adapted subject 
oe feels his hand to be in two 
ti erent places at once, with fluctua- 
ae in which of the localizations 
ems most "real" So even if the 
subject keeps judging relative to his 
and, his judgments may Waver. And, 
9f course, he may switch back and 
orth from directional to pictorial judg- 
ments, 


(In all 


tion of the subject's head and 
B: The sub- 
C: The sub- 


With further adaptation, according 
to the proprioceptive-change hypothe- 
sis, the felt locations of the subject's 
legs, torso, and perhaps even his shoul- 
ders and most of his head, change. 
He again feels, as he felt before the 
experiment started, that the (physi- 
cally) right half of his body is near 
the hand he writes with. Now he 
can reach accurately for objects seen 
through the reversing goggles, turn 
correctly, and correctly judge the di- 
rections of objects relative to his body. 


Pictorial Perception 

But still his pictorial visual percep- 
tion remains unchanged. The letters 
on the blackboard and the license plates 
on cars look just the same as they did 
when he first put on reversing goggles. 
The only difference is that he now 


feels the rig! 
on the same si 
backwards R 
1D), so he says 
right.” 

Why, then, 
come to look “п 
ing goggles? Because, 


de as the curve of the 
that he sees (Figure 
the curve is "on my 


does writing eventually 


ormal" through revers- 
with practice, 


436 


mirror writing becomes familiar and 
easy to read, whether the letters are 
actually printed in reverse or simply 
look reversed because one is wearing 
special goggles. Indeed, Kohler's 
(1964, p. 160) subject reported that 
“the first words to rectify themselves 
were the common ones,” which were 
seen most often through the reversing 
goggles. But it is a mistake to 
conclude that pictorial perception re- 
versed and “mirrorwise seeing” (“spic- 
gelbildliche Sehen”—Kohler, 1953, p. 
113) became established. We would 
not say that about someone who learned 
to read mirror writing without wear- 
ing reversing spectacles. 

Given this interpretation of adapta- 
tion, it is not surprising to hear that 
one stubborn subject (Taylor, 1962, 
p. 180) "achieved satisfactory behay- 
ioral adjustment" but “denied that he 
ever perceived the world the right 
way round through his spectacles," 
even after 71 days of Wearing them. 
When questioned closel 
said that all of his 
ments were made 
his right tem 


closer to the Part of the visual field 


Stratton’s reports 
er the felt location 


ples would 
the subject 
tell the expe 
Ing a pictorial 


left. In either i 
. case, this subject’ 
ceptions—visual and 41а 


according to Taylo 

" т, 
managed to “perceive the Word whe 
right way round,” 


Aftereffects 


The “peculiar experiences” 


Kol 
1964, p. 158) that the <. A 


ted sub- 
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ject encounters when he takes off the 
experimental spectacles are just what 
one would expect if the subject has 
undergone Proprioceptive changes, has 
become accustomed to the reversed 
appearance of particular stimuli, but 
has experienced no change in pictorial 
perception. 

When Kohler (1964, p. 158) re- 
moved his reversing goggles, after 
weeks of adaptation, and looked at а 
Picture which he had seen before but 
not during the experiment, the picture 
immediately looked familiar. The per- 
son in it appeared (pictorially) to be 
Tunning, as before, from left to right. 
Nevertheless, the person was seen as 
Tunning toward the left edge of the 
page; that is (as Kohler makes 
clear), toward where Kohler felt his 
(adapted) left shoulder to be. 

Kohler (1964, р. 160) does report 
that another subject, the one who 
“achieved almost completely correct 
Impressions" while wearing reversing 
Soggles, saw the whole room mirror- 
Wise when the spectacles were Te- 
moved. But the evidence for this 
Statement is that the subject read p's 
as q’s, b's as d's, and 10:30 on a clock 
as 1:30—which is just what woul 
happen, without any perceptual change 
if one read nothing but mirror writing 


and saw nothing but backwards clocks 
for 37 days, 


LIMITATIONS or THE PROPRIOCEPTIVE- 
CHANGE HYPOTHESIS 


Although changes in the position 
sense may underlie most of the phe- 
nomena of adaptation to optical dis- 
tortions, there are some kinds of adap- 
tation that cannot be so interpreted. 

or example, adaptation to the chro- 
matic dispersion produced by prism 
(“color fringes” —Kohler, 1964) seems 
to depend on changes in “contour CC 
tectors” within the visual syste™ 


(Hay, Pick, & Rosser, 1963; McCol- 
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lough, 1965a). Adaptation to bicolored 
Spectacles (Kohler, 1964) has been 
shown to depend on retinal color adap- 
tation and simultaneous contrast (Mc- 
Collough, 1965b). There appear to 
be .Several forms of adaptation to 
tilting, | curvature, and other optical 
distortions (see, e.g, M. M. Cohen, 
a Held & Rekosh, 1963; Kohler, 
964; Mikaelian & Held, 1964; Mo- 
à & Harris, 1965; Ohwaki, 1961). 
к of these appear to be purely 
i e changes; others may be based 
: changed registration of head or 
eye positions and movements. 
eae adaptation situations probably 
TI lude some  motor-skill learning. 
"IS component may sometimes be 
even larger than that due to proprio- 
е changes (as, perhaps, in Smith 
& Smith's, 1962, studies) or it may 
E much smaller but still considerable. 
or example, Hall (1964) and Harris 
(unpublished data) both found that 
"d en the arm movement used during 
aptation differed grossly from that 
Used in the tests, there was some dec- 
TOME in the measured adaptive shift. 
Наз data have been published by 
Teedman, Hall, & Rekosh, 1965.) 
Р... ѕоте саѕеѕ, а proprioceptive- 
ion interpretation | requires addi- 
ae assumptions similar to those 
М аде by other theories. For instance, 
© increased variability of responses 


t P 
Te follows watching one’s hand 
rough a variable prism whose 


amount of displacement keeps chang- 
Ing (Cohen & Held, 1960; summarized 
by Held & Freedman, 1963) could be 
due to increased uncertainty about 
arm position. Many of the “condi- 
Чопа] aftereffects,” which others а5- 
Cribe to the conditioning of visual per- 
Ceptions to nonvisual cues (Kohler, 
1964 ; Taylor, 1962), can be attributed 
to the conditioning of altered position 
Sense to these same cues. 

. Recently, several experimental find- 
ings have been cited as directly ruling 
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out a proprioceptive change in adapta- 
tion to displacement. Efstathiou and 
Held (1964) reported that adapting 
one hand to displacing prisms had no 
effect on blindfolded subjects’ reaching 
for the remembered locations of targets 
they had previously felt. They also 
found that the measured adaptive shift 
was smaller when the unexposed hand 
served as test target than when the 
target was a visually perceived object. 
Bauer and Efstathiou (1965) found 
an adaptive shift in pointing "straight 
ahead" only if subjects were first 
tested on pointing at visual targets; 
if tested on straight ahead first, the 
shift was sizable, but in the wrong 
direction. And H. B. Cohen (1963) 
reported that adaptation with the tar- 
get in the retinal periphery does not 
transfer completely to test targets on 
the fovea. It is difficult to assess these 
findings since each is contradicted, di- 
rectly or indirectly, by other findings 
(e.g.. Goldstein ^; Hamilton & Hill- 
yard 1; Harris, 1963a). The reasons 
for the empirical disagreements have 
not been satisfactorily worked out. 
In the reports of Stratton (1897), 
Kohler (1964), and Taylor (1962), 
several passages seem to describe per- 
ceptual changes that cannot be attrib- 
uted to altered position sense. Fur- 
ther work is needed to determine 
whether these statements are based on 
confusions about the various determi- 
nants of subjects’ verbal reports or 
result from some complicated altera- 
tions in position sense or do in fact rep- 
resent other sorts of adaptive change. 


OTHER THEORIES OF ADAPTATION 


Stratton 
After reading Stratton’s striking de- 


scriptions of the proprioceptive changes 


14 Donald Goldstein, personal communica- 


tion, June 1964, May 1965. 
15 Charles R. Hamilton and S. A. Hillyard, 


personal communication, July 1964, August 
1964. 
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he underwent, one is surprised to find 
him saying in his theoretical discus- 
sion that “the tactual perceptions, 2- 
such, never changed their place, am 
"the restoration of harmony perum 
the perceptions of sight and those i 
touch was in no wise a process o 
changing the absolute position of = 
tual objects so as to make it identica 
with the place of the visual objects 
[1897, p. 476]." He seems to be 
ignoring his own introspections when 
he claims that there is neither a change 
in proprioceptive localization nor a 
change in visual localization, but only 
a change in the relationship between 
the two. This noncommittal idea has 
proven quite appealing to many pres- 
ent-day psychologists. 2 
Clearly a change either in vision or 
in the position sense would result in 
a changed relationship between the 
two. But saying that only the rela- 
tionship between the two modalities 
changes is ignoring information about 


the changes within one (and only one) 
of the modalities, 


Stratton’s own re 
that the first ste 
inverted vision 
feet were in a 


ports make it clear 
p in his adapting to 
Was to feel that his 
new location relative 
to the rest of his body—a change 
Within the position sense. Gradually, 
the felt locations of more and more 
of his body swung into line with that 
Of his feet, that is, into line with 
the inverted visual scene. True, the 
final result was a new relationship be- 
tween the position Sense and visual 
perception, but this new relationship 
was brought about entirely by changes 
in the position sense, with no changes 
in vision. 

In 1897, Stratton 
theorized that adaptation is the at- 
tachment of new visual imagery to 


tactual sensations and, concurrently, 
the attachment of new {а 


ctual imagery 
to visual sensations. If we ignore the 


(рр. 472-475) 
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second half of this formulation and 
just postulate that adaptation copan 
of associating new visual imagery 0 
parts of the body with таркар 50 
stimuli from those parts, we can dea 
with much of the relevant data, pro- 
vided we make one additional та 
tion: that the felt position of a limb 
is not directly connected with eet 
ceptive stimuli, but is a byproduct е 
where the limb is mentally pictured. 
The visual imagery notion, then, 
would make the same predictions as 
the proprioceptive-change писан 
but requires an extra step—a step ла 

some subjects’ introspections deny. 


Taylor 


Taylor (1962, 1964) attempts i Ea 
plain all perception, from depth perc r 
tion to color vision, with a single hyp to 
thetico-deductive theory. Record op E 
this theory, visual perception of an п” 
ject is determined by the “авран 
of stimulus-response engrams—neu Пу 
traces of the responses uec apti 
walking, reaching, and verbal d to 
sponses) that have been conditione be 
similar stimuli. Taylor apparently eni 
lieves that adaptation to oper 
inversion, or reversal of the б 
images leads to changes in ace 
perception. and perhaps to ра 
changes as well (1964, р. 73). tual 
ever, he thinks that these р у 
changes, though genuine, are la y 

1 

10 1п a later paper, Stratton (n dsl 
State, contradicting his earlier d o 
Views, that "the place in which any m the 
the body is persistently seen Vara 
localisation of the dermal and kindre to see 
tions arising in that part. If one were differ- 
his feet, for instance, in some irec 
ent from their present visual posi ir kin- 
would in the end refer thither ша 
aesthetic impressions also [р. 4631; otably 
Only a few subsequent ENSE о) have 
Walls, 1951; Smith & Smith, 1 ter inter- 
Paid much attention to Stratton's la detailed 
Dretation or even to his original es. 
descriptions of proprioceptive chang 
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the result of changes in verbal labeling 
(1962, pp. 179-181, 185). He expects 
adaptation to progress in piecemeal 
fashion with visual perception becom- 
ing more and more veridical as the 
Subject acquires a larger number of 
appropriate responses (1962, pp. 188, 
197, 207). 

Beyond this, it is difficult to derive 
unequivocal predictions. For example, 
Taylor's theory could predict either 
Way about transfer to most of the tasks 
in Table 1, depending on what assump- 
tions are made about steepness of gen- 
eralization gradients, breadth of “equiv- 
alence classes,” degree of “interpene- 
tration” of sensory modalities, and rela- 
tive importance of motor behavior and 
Implicit verbal responses. 


Kohler 


, Kohler's (1964) studies of adapta- 
tion to a wide variety of visual distor- 
tions have provided the inspiration, di- 
rectly or indirectly, for much of the 
research in this field. He has concen- 
trated on setting down his observations 
rather than on providing a detailed 
theory, It is clear that he agrees with 
Taylor that adaptation involves changes 
in directional perception, based on the 
acquisition of new behavioral responses 
to transformed retinal images (see, e.g., 
Pp. 163-164). With prolonged ex- 
Posure to reversing spectacles, Kohler 
Says, there are eventually pictorial 
changes, with more and more stimuli 
seen correctly” (pp. 140, 163-164). 
Unlike Taylor, however, Kohler thinks 
that verbal labeling is of no great sig- 
nificance in adaptation. 

Although Kohler did mention (in a 
footnote) that “alterations in kines- 
thetic sensitivity" may be “of crucial 
importance" (p. 32) in adapting to dis- 
Placement, he did not make much use of 
these alterations in explaining other as- 
Pects of adaptation. In fact, in his dis- 
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cussions of reversed vision, he regards 
such alterations as transitory—normal 
proprioception and kinesthesis are soon 
reinstated, and form the basis for the 
"correct" visual perception that ulti- 
mately emerges. In his theoretical dis- 
cussions, Kohler did not attempt to ex- 
plain the simpler phenomena of adapta- 
tion to displacement, inversion, and 
reversal of retinal images, but rather 
dealt with the complex "situational 
aftereffects.” 


Held 

In an extensive series of carefully 
controlled experiments, Held and his 
co-workers have demonstrated the im- 
portance of active movement and move- 
ment-produced visual feedback (“re- 
afference") in producing adaptation. 
These experiments set the pattern for 
most of the recent work in the area: 
brief adaptation periods with quantita- 
tive before-after measurements. 

Held has been primarily concerned 
with the necessary preconditions for 
adaptation; he has said little about 
the nature of the adaptive change (see, 
eg. Held, 1961). It is not clear 
whether Held believes that adaptation 
involves any perceptual changes, visual 
or proprioceptive. For instance, Held 
and Freedman (1963) say that adapta- 
tion "represents a change in state of the 
relevant sensorimotor control system, 
such that [after complete adaptation] 
the input-output or stimulus-response 
relation becomes identical to that which 
existed prior to rearrangement [р. 
457]." Recently Efstathiou and Held 
(1964) proposed a tentative theory of 
arm adaptation to displacement, accord- 
ing to which “the change responsible 
for the shifts occurs in a representation, 
within the nervous system, of the spa- 
tial relation between the exposed arm 
and directions that are defined inde- 
pendently of that arm.” Further elab- 
oration of this model is necessary to 
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determine how it differs from the pro- 
prioceptive-change interpretation. 


Smith and Smith 


Smith and Smith ( 1962) claim. that 
adaptation consists of acquiring E d 
specific perceptual-motor skills. They 
seem to deny that there is any general 
reorientation of perception, whether di- 
rectional or pictorial (1962, pp. 83, 
311). Their research and theory sup- 
plement the Proprioceptive-change in- 
terpretation by dealing with situations 
in which proprioceptive changes prob- 
ably are minimized because there is a 
large spatial separation between the 
felt location of the hand and its tele- 
vised visual feedback. The acquisition 
of highly specific perceptual-motor 
skills is facilitated by Smith and 

Smith's tasks, which permit continuous 
visual feedback and stress Speed of 
execution. Indeed, their usual measure 

(speed of performance) is sensitive 
only to the development of highly prac- 
ticed motor skills. However, it is pos- 

Sible that adaptation of Smith and 
Smith's three movement systems—loco- 
motion, transport, and manipulation— 
may in part represent, respectively, 
proprioceptive head or eye adaptation, 


arm adaptation, and acquisition of 
manipulatory skills. 


Werner and Wapner 


Werner and Wapner (1955) have 
discussed some of Kohler's findings in 
terms of their org 


anismic sensory-tonic 
field theory, attributing adaptation to 
changes in the subject’s “organismic 
state — (sensory-tonic distribution) ,” 
Basically, though, their account simply 
restates Kohler's observations. Some 
of their statements would match the 
present author's if the words “felt posi- 
tion" of certain body parts were substi- 
tuted for such abstract terms as “ог. 
ganismic state” or “equilibrial axis.” 
But Werner and Wapner consider the 
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organismic state to be only one part of 
the process that determines body ae 
ception, not the perception itse . 
Могеоуег, {һе organismic state is а5- 
sumed to include “not only postural, 
but emotive, motivational factors, etc. 
[p. 133],” which are clearly beyond the 
scope of the present formulation. 


PRECONDITIONS AND MECHANISM 
FOR ADAPTATION 

Visual Proprioceptive Discrepancy 

Although the proprioceptive-change 
interpretation of adaptation does not 
Specify any particular process or pre" 
condition for the change, it is tempting 
to assume that adaptation results шош 
a discrepancy between proprioceptive 
and visual information. One effective 
way to produce such a discrepan 5 
to look at some part of one’s bo | 
through distorting goggles, but it is 1 
the only way. For instance. when 
subject walks while wearing ees 
prisms, his position sense may шка 
that his head is turned to one side e 
the direction of movement, whereas 50; 
retinal flow pattern (Gibson, 19 M 
Held & Freedman, 1963) may щил 
that the head is pointing right along 
the axis of movement. . 5 

On the other hand, there 49 had 
logical necessity that proprioceptive Р 
Puts be used at all by the metano, 
that recalibrates the position sense. "e 
could be, as Helmholtz (19625) p 
gested, that adaptation is based pus ult 
changes in the retinal image that res A 
from a given “effort of the will, side 
Held and Freedman's ( 1963) m 
nology, on motor corollary discha ae 
and visual reafference contingent "Ti 
active movement. Or the pup 
tive Change could be due to the T Tes 
down of engrams of Pando or 
Sponses, like those postulated by 
(1962). 


may 
Some proprioceptive changes 
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arise from something like "sensory 
fatigue," without any direct participa- 
ton by vision. Hein (1965) has found 
that after simply holding their heads 
turned to one side for 10 minutes, even 
with their eyes closed, subjects point 
incorrectly at visual targets. This 
„Postural after-effect,” as Hein called 
it, probably was due to a change in 
felt head orientation. Similarly, Koh- 
ler (1951, p. 18) reports that subjects 
Who wear displacing prisms tend to 
hold their eyes in an abnormal posi- 
tion that eventually comes to feel nor- 
mal once more. Such a "fatigue" effect 
Could indeed produce some of the phe- 
nomena of adaptation to displacement, 
but it can underlie neither adaptation to 
Inversion or reversal nor arm adapta- 
tion to displacement. 


Active Movement 


A number of experiments by Held 
and his colleagues have shown that ac- 
tive movements by the subject play an 
Important role in adaptation to several 
Optical distortions (Held & Freedman, 
1963). Although some recent investi- 
Sators have claimed to find extensive 
aplanon with passive exposure (e.g, 

allach et al., 1963; Weinstein, Ser- 
Sen, Fisher, & Weisinger, 1964), active 
movement does seem greatly to facili- 
tate adaptation. 

Theories that consider the end prod- 
Uct of adaptation to consist of new 
Motor responses or new visuomotor 
Correlations (e.g., Held, 1961; Smith & 
Smith, 1962; Taylor, 1962) also as- 
sume that active movement is a crucial 
Precondition for adaptation. On the 
Other hand, Hamilton (1964a) has 
listed several ways to account for the 
importance of active movement with- 
Out postulating any motoric component 
in the end product. For example, the 
Position sense during active movement 
may differ from (and be more precise 
than) that during passive movement. 
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Or motor discharges may act as a “cat- 
alvst" that permits a joint's position 
sense to change. 


IMPLICATIONS FOR PERCEPTUAL 
DEVELOPMENT 


Psychologists have traditionally 
looked to studies of adaptation to dis- 
torted vision for clues about the devel- 
opment of visual perception in the in- 
fant. The usual, empiricist assumption 
(outlined by Berkeley in 1709 in his 
New Theory of Vision; see Berkeley, 
1910) is that visual space perception is 
“secondary”: It is based on the spatial 
sensations given by touch, kinesthesis, 
and position sense. As Dewey (1898) 
put it: “Ultimately visual perception 
rests on tactual. . . . Spatial relations 
are not originally perceived by the eye, 
but are the result of the association of 
visual sensations with previous muscu- 
lar and tactual experiences [p. 165]." 

This belief in the primacy of touch 
is so ingrained that experimental re- 
sults are sometimes flagrantly misin- 
terpreted in order to support it. Carr 
(1925), for instance, concluded: "It 
is thus obvious that the Stratton ex- 
periment involves no reconstruction or 
alteration of tactual . . . space. It is 
the visual system that is disrupted and 
then reorganized so as to conform to 
touch ... [p. 141].” Stratton's, Koh- 
ler’s, and Held’s findings have been 
cited over and over as evidence that 
visual space perception is flexible and 
therefore must have been acquired 
through tactile-proprioceptive and 
motor experience. The reinterpreta- 
tion of these findings that has been pre- 
sented here suggests the opposite con- 
clusion. Vision seems to be largely 
inflexible, whereas the position sense 


is remarkably labile. 

The implication, if one dare draw 
any, is that the Berkeleyan notion 
should be turned around. It seems 
more plausible to assume that proprio- 
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ceptive perception of parts of the body 
(and therefore of the locations of 
touched objects) develops „with the 
help of innate visual perception rather 
than vice versa." A growing number 
of recent studies support the view that 
many aspects of visual perception are 
not influenced by experience and are 
largely innate (e.g, Bower, 1964; 
Fantz, 1965; Gibson & Walk, 1960; 
Hubel & Wiesel, 1963; Robinson, 
Brown, & Hayes, 1964). Further- 
more, if the position sense were innate 
—if each spot on the skin were proprio- 
ceptively "preaddressed"—the local 
sign lodged in a baby's fingertip might 
go on forever signaling that his arm is 
10 inches long. 

So, when a baby stares raptly at his 
outstretched hand, he is probably find- 
out where his hand is, not what his 
visual sensations mean. He is making 
use of an adaptive mechanism that 
keeps his position sense accurate de- 
spite extensive and uneven growth of 
his body. This mechanism enables us 
to use the precise, detailed information 
that vision provides, as a means of con- 
tinually readjusting our vaguer and 
more variable position sense. 


17 Clearly vision is not the Only basis for 
acquiring and maintaining the position sense 
Or blind people would have no idea where 
their arms and legs were. 


however, provide the quickest а: 
recalibration, 


Vision may, 
nd most exact 
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STRUCTURE OF PHENOMENAL DOMAINS 


JOHN L. RINN: 


Division of Counseling Psychology, University of California, Berkeley 


The development of theoretical systems in the social sciences has 
proceeded without sufficient attention to structural similarities between 
different universes of content. This has interfered with the discovery 
of isomorphic relationships between theories. In this paper the central 
variables of teacher-counselor personality are assigned to 3 domains 
( Personal, Interpersonal, and Attitudinal) which are analyzed in terms 
of similar dimensional structures. Facet analysis is used to compare 
domain elements and to discover empirical patterns of variables. 
Finally, the interdomain correlations are used to construct a heuristic 


typology of counselor syndromes. 


When Jimmy puts a mouse into 
teacher’s desk, many things are likely 
to happen. If I describe this event 
from my point of view and you describe 
it from yours, the two descriptions will 
certainly be different and may very well 
appear to refer to two separate events. 

mong other explanations for this oc- 
currence is the one that you and I are 
attending to different classes of phe- 
nomena, that we perceive and describe 
Events selectively. 

How then can we communicate with 
and understand each other? My pro- 
posal is that each of us should first 
identify the domain of phenomena to 
Which he is attending and then describe 
Its structure, that is, identify the major 
Parameters which can be used to rank 
the domain elements. If we will both 
do this, my hunch is that we will find 
Some degree of similarity among 
Parameters even when we are con- 
Cerned with different domains. Hope- 
fully, these common parameters will 
help us bridge the communication-re- 
Search gap which now stands in the 
Way of our constructing a broad science 
of man. It would also be fortunate if 
the normal outputs of our research 
methodologies corresponded to the 
structural characteristics of our theo- 
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retical models. That is, a regression 
methodology is appropriate for a theory 
which emphasizes the cumulative ef- 
fects of variables upon a criterion, 
while factorial methods are appropriate 
for theories which emphasize the dis- 
tance relationships among variables. 
Fiske (1963, p. 643) has recently used 
this point in his plea for closer collabo- 
ration between theorists and methodol- 
ogists, and Brodbeck (1963, pp. 88-91) 
has asked researchers to pay more at- 
tention to isomorphic relationships be- 
tween model building and measure- 
ment. 

The present paper deals with that 
broad class of personal-social events 
which are of concern to members of 
the education profession. These events 
have been analyzed in a number of 
ways, including the schemes of Bloom 
(1954), Krathwohl, Bloom, and Masia 
(1964), and Gage (1963, p. vi). My 
preference has been to assign such 
behavior to one of three domains : Per- 
sonal, Interpersonal, and Attitudinal. 

In the following sections an illus- 
tration is given of how descriptions of 
phenomena are assigned to one or 
another of the three domains, and each 
domain is conceptualized in terms of its 
hypothesized dimensional structure. 
Following Humphreys’ (1962) pro- 


446 


posals, two of the domains are ex- 
plicated by means of facet-analysis pro- 
cedures to demonstrate a desirable ra- 
tional link between theory building and 
test construction, and factor analyses of 
data from 350 counselors are employed 
to test the empirical relationships of 
domain variables. Finally, interdomain 
correlations are used to construct a 
heuristic typology of counselor syn- 
dromes. 


PHENOMENAL DOMAINS 


The process of perceiving and de- 
scribing social phenomena Tequires a 
perceiver, a stimulus object, and a 
verbal statement presumed to represent 
the cognitive outcomes of observations 
and introspections. The stimulus ob- 
ject in social perceiving may be an- 
other person, one’s “self,” or an inter- 
action between Persons ; and the verbal 
statement may refer to an observable 
behavior, an internal behavior of the 
self which is not observable by others, 
or an inferred state or attribute. Al- 
though the last type of statement is 
included somewhat reluctantly because 


ti ments as 
the following : (a) I see your clenched 
hands; (b) І feel irritable; (c) You 


In the present Scheme, Statements 
are classified in terms of the distal 
stimulus to which they refer, Thus, 
statements which refer to such non- 
interactive, sensory-motoric behaviors 
as postures, movements, and visceral 
states are said to belong to the Personal 
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Domain. In the sample of statements 
given above, the first is descriptive of 
extraorganismic behavior, the second, 
of intraorganismic behavior, and the 
third of an implied behavioral state 
in the other person. В 
Statements which refer to behavioral 
interactions or relationships between 
persons such as giving, taking, protect- 
ing, and criticizing are said to belong 
to the Interpersonal Domain. The 
fourth statement above is of this type. 
Repeated encounters between a per- 
son and his environment tend to lead to 
a set of rather stable expectations and 
evaluations of that environment. 
These cognitive-evaluative dispositions 
of the person function to stabilize his 
Perceptions of the world and others 
Perceptions of him and may be referred 
to as his expectations, sets, values, 
Toles, and the like. Statements, like 
the last mentioned above, which refer 
to such generalizations regarding social 


relationships are said to belong to the 
Attitude Domain, 


The Interpersonal Domain 


The problem of the structure of the 
interpersonal relationship has been ap- 
proached by research workers in many 
different fields, In spite of relative 
independence of purposes, traditions, 
and methodologies, the results of the 
major studies have been strikingly 
similar, particularly those which pari 
used factor-analytic methods © 
analysis, 

In a review of the investigations E 
Carter (1954), Leary (1957), pen 
gatta, Cottrell, and Mann (1958), an 
Schaefer (1959), it was concluded bY 
Foa (1961) that the common finding à 
in all studies were two substantive © s 
mensions of interpersonal behavior 
Which could be identified as the Domi 
Dance-Submission dimension and ues 
Hostility-Affection dimension. псе 
factors appeared to be related to f 
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specific methodology used, to situa- 
tional influences, or to sequences of 
actions. More recently, Adams (1964) 
has claimed that all interpersonal be- 
havior, both adaptive and maladaptive, 
i be meaningfully categorized within 
these two dimensions, and he has cited 
additional support in the factor-analytic 
studies of the MMPI performed by 
Welsh (1956) and Jackson and Mes- 
sick (1961). ` 
It was also concluded by Foa (1961) 
that the arrangement of variables in all 
Studies tended toward that patterned 
relationship known as the circumplex 
Kouman, 1954a). That is, when the 
actor loadings of the variables are 
Plotted against the two major axes, à 
Circular. order tends to appear. This 
ne was most apparent among 
he items which comprised the Inter- 
Personal Check List (ICL), the instru- 
rue devised by LaForge and Suczek 
rwn: to give operational meaning to 
e ary's (1957) interpersonal system 
ае Leary’s system is based 
= he two dimensions of behavior de- 
es above, and the circularity of the 
CL variables appears to be due to 
their having been selected rationally on 
this basis. 
Interpersonal Space. In Figure 1 
а attempt has been made to visualize 
e two major dimensions of inter- 
Personal behavior in terms of a 
two-dimensional spatial model which 
Corresponds in appearance to the two- 
factor solution of a factor-analytic 
Procedure. Figure 1 has been marked 
Out in sectors for heuristic purposes, 
that is, to lead the imagination of the 
reader easily to the circumplex model 
Which will be presented in Figures 4 
and 5. The dimension of emotion has 
been characterized by affectionate be- 
haviors in one sector of the model and 
critical behaviors in the opposite sector. 
Interpersonal power is described as 


Affectionate 


Fic. 1. Interpersonal behaviors : 
emotion and power. 


varying from dominant behaviors to 
submissive behaviors. 


The Attitude Domain 


An attitude is identified by a consis- 
tency of response to a class of objects 
(English & English, 1958, p. 50). At- 
titudes in the field of education might 
have as their objects abstract social 
values, broad educational goals, differ- 
ential curriculum purposes, course out- 
comes, instructional procedures, Or 
various kinds of person-to-person inter- 
actions such as teacher-pupil relation- 
ships. One way of categorizing these 
objects is in terms of those having to 
do with goals or outcomes and those 
having to do with procedures or 
processes. It has been suggested that a 


general predisposition exists such that 
tend to perceive social 


some persons 

situations in goal-oriented terms and 
others in process-oriented terms, and 
persons with different dispositions may 
have special communication difficulties 
(Rinn, 1961a). Attitudes toward in- 
terpersonal relationships may be ex- 
have the same dimensionality 


pected to 
as the actual descriptions of those rela- 
tionships. For the sake of keeping а 
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clear distinction between the two do- 
mains, however, the attitude dimen- 
sions will be referred to by the terms 
emotionality and control. 

Although emotionality in its pure 
form consists of one person liking or 
disliking another, emotion in an educa- 
tional setting is usually expressed as 
an aspect of the teacher’s reaction to 
task accomplishment or rule adherence. 
Thus the emotional component of a 
teacher’s attitude toward the work of 
students is likely to be referred to as 
a tendency to give praise or blame, 
while the affective reactions to class- 
room behavior are likely to be described 
as the teacher’s tendencies to reward or 
punish or to give approval or disap- 
proval. Experimental manipulation of 
this variable has been described by 
Kounin, Gump, and Ryan (1961 
comparison of threatening versus sup- 
portive desist techniques. Emotional- 
ity would also seem to have been a 
major criteria for distinguishing be- 
tween ari 4 M nonpunitive teach- 
ers in a stu by Kouni 

(1961). y by nin and Gump 

The control dimension is the major 
component of such conceptual dichoto- 
mies as traditional versus progressive, 
autocratic versus democratic (Lippitt 
& White, 1958), dominative versus in- 
m a (Anderson & Brewer, 1946), 
and conservative versus permissiz 
(Lazarsfeld & Thielens, 1958). Coun 
selors went through a long period of 


dispute concerning the issue of direc- 


tive versus nondirective counseling 


(eg., Rogers & Skinner, 1956). 
Differentiation of the two ends of 
this continuum is not a simple process 


because the definition may focus on 
different levels and obje 


Jina 


cts. At the 
broadest value level, there exist the 
contrasting doctrines of individual 


rights and the rights of society, social 
liberalism and social Conservatism, the 
inherent wisdom of the child and 


Joun L. Rinn 


the experiential wisdom of the adult, 
the goodness of man and the badness 
of man. At the level of learning the- 
ory, belief in highly structured learning 
programs contrasts with belief in op- 
portunistic development of immediate 
interests. At the behavioral or leader- 
ship level, teachers and counselors can 
give many directions or few directions, 
make many decisions or few decisions, 
and adherence to suggestions may be 
made a matter of rigidity or flexibility. 
Said another way, students may be ex- 
pected to follow orders or to take 
initiative, 

Attitude Space. As with the Inter- 
personal Domain, the two major dimen- 
sions of the Attitude Domain may be 
given spatial representation as in Fig- 
ure 2. On the emotionality dimension 
the term sociable implies both warmth 
and closeness between persons while 
the term detached is intended to ine 
clude both cool detachment and active 
antagonism. For the control dimen- 
sion the term bermissive suggests that 
the locus of responsibility of decision 
making has been delegated to the stu- 
dent and implies that the elder has 
faith or trust in the student's ability 
to benefit from taking such responsi- 
bility. The term directive, on the other 
hand, Suggests that control remains 11 
the hands of the one to whom legal 
responsibility has been assigned and 
implies that the elder is trusted tO 


exercise this control effectively ап 
benevolently, 


The Personal Domain 


What is the structure of personal 
behavior? A person thinks, feels, 
moves. The authors of the handbooks 
in the Taxonomy of Educational Ob- 
Jectives series (Bloom, 1954; каш 
Wohl et al, 1964) found that mos 
educational objectives could be place 
rather easily into one of three maj E 

omains : cognitive, affective, and PSY 
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chomoter. There were few references 
found in the literature to the psycho- 
motor domain, and the question was 
raised as to whether a human being 
ever does acting without feeling or 
thinking. > 

Schachtel (1959) has contributed an 
analysis of man’s sensory experiences 
which calls for application to the pres- 
ent gap between theories of perceptual, 
Psychoanalytic, and educational devel- 
opment. Schachtel identifies two basic 
perceptual modes which he calls the 
autocentric and the allocentric modes 
А нан н The autocentric mode 
sabi pe by the sensory experi- 
fo of pleasure or displeasure, com- 

rt or discomfort. The allocentric 
mode is characterized by objectifica- 
tion, the phenomena of one's encounter 
With more or less definite environmen- 
z objects. This encounter may vary 
rom an actively initiated exploration 
of the environment to a passive re- 
ception of environmental stimulation. 
Ginzberg (1956) has proposed a basic 
personality predisposition towards ac- 
tivity or passivity which appears to 
correspond to the allocentric dimension 
of perception. 


Sociable 


Detached 


Permissive 


nitive attitudes : emotionality 


Fic. 2. 
and control. 


Cogr 
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Pleasant 


Unpleasant 


Fic. 3. Personal modes: feeling and activity. 


Although I have no empirical evi- 
dence which bears directly upon the 
structure of the Personal Domain, I 
find that I am attracted to the notion 
that similar structures may obtain 
among all three domains. Therefore, 
I propose on a purely rational basis 
that the Personal Domain will be found 
to be structured in terms of two major 
dimensions, feeling and activity, which 
correspond to the dimensions described 
in the Interpersonal and Attitude do- 
mains, The feeling axis will vary from 
a comfortable state of the person on 
the one hand to an uncomfortable state 
on the other. The activity axis will 
vary from active exploratory behaviors 
receptive behaviors, and 


to passive 
be either motoric 


these behaviors may 
or cognitive in nature. 

Since a person responds as a whole 
organism whenever he does respond, 
any action may be accompanied by 
varying degrees of feeling tone, from 
the most positive to the most negative. 
It seems reasonable, then, to represent 
the two axes as in Figure 3, where а 
given behavior is given spatial repre- 
sentation with reference to both the 
feeling and activity dimensions. 
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Facet ANALYSIS 


The intellectual process of ro 
complex concepts into simple edes 
elements is a logical procedure ne 
is basic to all scientific endeavor. The 
term facet theory, however, has dm 
to stand for a set of notions put fort 
by Guttman (19582) „regarding the 
structural design of social and psycho- 
logical theories which utilize composite 
(complex) concepts. Guttman has de- 
fined composite concepts in terms of 
Cartesian products of the elements of 
simpler concepts called facets. . Gutt- 
man  (1958b) regards Guilford’s 
(1956) work on the structure of intel- 
lect as essentially a search for more 
refined facets for mental tests or for 
the expression of gross facets of in. 
tellect as Cartesian products, Exam- 
ples of facets in the area of mental 
ability are kind of content and level 
of complexity. 
An important postulate of facet the- 
ory is the contiguity principle which 
states that the correlation between two 
variables increases with the simil 
of their facet structures, 
has applied this Principle to the pre- 
diction of correlations among dyadic 
relationships and has demonstrated the 
feasibility of performing facet analyses 
of social telationships. Facet elements 
which Foa (1963) has recently iden- 
tified include distinctions between ob- 
Servers, actors, content of behavior, 
object of behavior, mode of behavior, 


alias (point of view), and level (actual- 
ideal). 


arity 
Foa (1958) 


Facets of the Interpersonal Domain 


In his review of studie 
ture of interpersonal 
(1961) proposed that the two axes 
of Dominance-Submission and Love- 
Hostility were sufficient for describin 
the empirical findings, but that they 
were not sufficient for explaining the 


5 of the struc- 
behavior Foa 


underlying structure which Lips 
a circumplex pattern to appear. in 
desired explanation of circularity wa 
found in the theory of principal I 
ponents of scalable attitudes peque 
by Guttman (1954b) based on Е 
integration of mathematical ana pay- 
chological concepts. In terms ot a 
theory, the circumplex pattern is wae 
marily a function of the second mei 
fourth of the first four principal com 
ponents. Foa therefore developed ig 
fourfold facet structure of the иш. 
personal act based on the content о 
е action (acceptance ог rejection) 
the intensity of the action (high z 
low), the object of the action (garo 
other), and the mode of the vim 
(social or emotional). In combining 
these elements into profiles, Foa 18- 
nored the intensity component and 
defined love and hostility as aeee 
or rejection of affect i genes 
mode), while dominance and ped 
Sion were defined as acceptance oF Mis 
jection of status (social mode). far 
addition of the object element (sel ex 
other) made eight facet profiles pos а 
ble, and these facet combinations e 
listed below. It should be noted hers 
their order is different from that pi I 
posed by Foa (1961, p. 348), anc 
shall say more about this later. 


- Accept the self's power. 
Accept the self's emotion. 
Accept the other's emotion. 
- Accept the other's power. 
Reject the self’s power. 
Reject the self’s emotion. 
eject the other’s emotion. 
eject the other’s power. 


A unit of interpersonal behavior D 
now be defined in terms of a files. 
values for these eight facet pro ince 

пе unit is a complex concept Sih 
it includes the relative positions © wer 
self and other with regard to es 
and emotion, Thus, in a single 2 


mosmHOOuW»- 


| 
| 
| 
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Accept 


Fic. 4. Circumplex of interpersonal facets. 


PE e usually asserts the relative 
Status positions of himself and another 
sm and evokes a particular emo- 
onal relationship between himself and 
another, 
vL circular order which appears 
ong measurement variables is as- 
гаа to depend on circularity of 
acets. The resolution of the order is 
accomplished by means of theoretical 
assumptions regarding facet contiguity. 
isualization of the assumed facet re- 
lationships has been attempted in Fig- 
ure 4 where concentric circles stand 
for the various facets, and sectors of 
the various circles represent facet ele- 
Ments. The letters A through H rep- 
resent the eight facet profiles listed 
above and are related diagrammatically 


to facet elements falling between the 
circle center and various points on the 
largest. circumference. The relative 
position of these letters gives visual 
representation to the circularity as- 
sumed to hold among the facet profiles. 

Given a circular set of facets and a 
set of variables purporting to represent 
the facet products, certain conditions 
are necessary before the appearance of 
a circumplex pattern among the varia- 
bles can be predicted. One condition 
is that it be possible to arrange the 
variables and the facets (or facet pro- 
files) into rows and columns of a 
matrix such that parallel diagonal lines 
will divide the matrix into sectors 
where all the facet elements have simi- 


lar values. 
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TABLE 1 
Facet Composition or LraRY's Types 


Leary's types 


Facet profiles 


> 
w 


D É 


E 


1. Managerial 
. Competitive 
- Aggressive 
. Skeptical 

. Modest 

. Docile 


. Cooperative 


омо а б wN 
+ +\o © о o\+ + 
+ + +\e о o о\ 
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+ +\o 


++ + +\°= о е ola 
OV ХБ е е 
e Sit ++ + +\е e 
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This condition can be illustrated 
using Leary's (1957) eight interper- 
sonal types as variables and the above 
acet pro- 
alues are 
the sake 
used to 
‚апда 
It can be seen 
for circumplex 


as indicating 
behavior is а 
of self and 


to both power and emotion. Skeptical 
behavior involves rejection of the self's 
power and emotion while acceptin 


the other’s power and rejecting his 
emotion. 


The facet ordering illustrated above 


differs from that Proposed by Foa 
(1961) in that Foa, apparently for 
theoretical reasons, reversed the order 
of the mode components, This rever- 


Sal does not affect the interpretation 
of the Managerial, Aggressive, Modest, 
or Cooperative behaviors, but it gives 
а peculiar slant to the other four types- 
Thus, Foa's interpretation of Com- 
petitive behavior would be that it 1m- 
volved accepting emotion and rejecting 
power in oneself, rather than accepting 
Power and rejecting emotion as pro- 
posed above, Again, Foa's scheme 
Says that Responsible behavior includes 
rejection of the other's emotion and 
acceptance of his power which pon 
not appear to be as reasonable an €x- 
planation as the other way around. 

What is claimed here is that the 
Ordering proposed in Table 1 results 
in behavioral units that have more oe 
validity than those resulting from Foa $ 
alternative ordering. Fortunately an 
empirical procedure can be suggestes 
for testing how closely a rational face 
Structure fits a set of variables ог Ei 
comparing alternative structures yn 
as Foa's and the present one. А Че 
Scription of the results of a small ed 
using this Procedure is presented а 
Primarily for the purpose of a 
how facet theory can be applied 
these questions, 
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Empirical Facet Analysis of ICL Items. 
Items from the ICL were taken as examples 
of variables which represented the eight in- 
terpersonal types listed in Table 1. These 
items were used to test the facet structure 
Proposed here and to compare it with that 
suggested by Foa. To do this, eight phrases 
were derived from the eight facet profiles 
listed above and were matched by nine 
Practicing school psychologists against each 
of the first 32 items on the ICL. Each item 
appeared on a page above the eight phrases 
which were in the form: “accepts self exert- 
ing social pressure toward others" and 
accepts self expressing affection toward 
Others." The other phrases either substi- 
tuted rejects for accepts or reversed the posi- 
tions of the words self and others. The 
instructions stated that phrases should be 
Checked only if they were implied by a sub- 
ject's positive response to the item, and no 
restriction was placed on the number of 
phrases to be checked. 

The number of checks for each phrase on 
cach item was tallied, and the items were 
grouped in terms of their application to 
Leary's eight types. Each cell in Table 2; 
then, represents the total number of times a 
facet was checked by nine persons on one of 
the four items which applied to a particular 
type. Maximum possible agreement among 
the nine raters was thus 36, and the largest 
actual frequency was 32. The interpretation 
of this particular cell frequency, to illustrate, 
is that there is high rater agreement on the 
correspondence of meaning between Coopera- 


tive inventory items and the facet phrase 
“accepts others expressing affection toward 
self.” 

An empirical estimate of the facet profiles 
underlying each of Leary’s types is given by 
the rows in Table 2. These may be com- 
pared with the rational profiles which are 
given by the corresponding rows in Table 1. 
The goodness of the fit is obscured for this 
set of data by the fact that the whole set of 
items was biased (more precisely, the inter- 
action between raters and items was biased) 
in the direction of "acceptance" or "rejec- 
tion" and probably also in the direction of 
the facet-element interactions “accepts af- 
fection” over “accepts social pressure” and 
in the direction of “rejects social pressure” 
over “rejects affection.” These conclusions 
come from a comparison of the column totals 
in Table 2 and are most easily explained by 
the fact that the first 32 items on the ICL 
were selected to represent the highest level 
of socially desirable phrasing. 

Because of these biases, the best test of 
goodness of fit is to ask whether the cells 
which correspond to the + cells (from Table 
1) have higher values than the 0 cells in 
the same column. When this is done, it is 
found that the + cells are higher than all 0 
cells in the same column in 21 out of 32 cases, 
and that the means of the four + cells are 
higher than those of the four 0 cells in all 
eight columns. 

The data presented in Table 2 also make 
it possible to compare alternative orderings 
of facets. To illustrate, Foa’s (1961) pro- 


TABLE 2 
Emprricat Facet Anatysis or 32 ICL ITEMS 


Facet profiles 


ONES D E F G H 
1. Managerial 19 10 7 9 18 
2. Competitive 14 11 10 11 19 
3. Aggressive 17 5 10 11 15 
4. Skeptical 17 10 17 20 12 
5. Modest 12 18 20 28 20 11 9 4 
6. Docile 14 26 30 25 17 5 4 
7. Cooperative 17 27 32 27 9 4 0 
8. Responsible 23 29 29 25 7 4 6 

Totals тез 180 187 102 339 — 67 69 81 
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osal is compared with the one те 
h re. Since the points of difference have о 
do only with the profiles а on 
numbered types, the comparison = Sen 
tricted to the disputed cells. The com 
кексе is made by counting the number of 
hits in the disputed cells of Table 2 for p 
roposal. Thus Foa had proposed that t e 
p of facet profiles for Leary's Competitive 
type should consist of Profiles B, RE im 
Н, while the present proposal is for Pro! d 
A, F, G, and H. If the common cells ( 
and H) are eliminated and the other values 
summed, the result is a standoff: 33 to 34, 
In the other three cases, however, (types 4, 
6, and 8), the comparison favors the present 
proposal over Foa's: 37 to 26, 43 to 19, and 
35 to 29 for the Skeptical, Docile, and Re- 
sponsible types, respectively. Thus, although 
the sample of raters was small, the facet 
structure proposed here is supported, and an 
important rational link between. theory build- 
ing and test construction was illustrated. 
The converse of this process—after deter- 
mining on theoretical grounds the facet de- 
sign which underlies a particular domain of 
investigation—would be to write items which 
meet the specifications of each combination 
of facet elements. The intercorrelations of 
variables thus defined could then be predicted 
in terms of their facet elements. 


Facets of the Attitude Domain 


Several considerations went into the 
development of a facet Structure for 
the Attitude Domain. The first had 
to do with identifying the facets which 
might underlie educational attitudes, 
This was approached by asking if the 
facets which had been 
statements about interpersonal inter- 
actions could be applied to statements 
regarding educational attitudes. А 
positive answer was found to be in 
harmony with the present position that 
(a) the educational Setting is one in 
which certain kinds of interpersonal 
interactions Occur, (b) teachers and 
counselors take value positions with 
regard to the kind of interactions which 
they believe ought to occur, (c) edu- 
cators are concerned with the issue of 
the relative degree of Status, control, 
or respect which ought to be held by 


used to analyze 


Joun L. Rinn 


teachers, students, and others, and (d) 
educators are concerned with the kind 
and degree of emotional relationships 
which are likely to contribute to the 
achievement of various educational 
goals. The school-counselor popula- 
tion in particular can be expected to 
be sensitive to the interpersonal ele- 
ments of educational practices. 

On the basis of the above assump- 
tions, then, it was held that the facets 
which underlie educational attitudes 
should be similar to those which were 
used to explicate the varieties of е 
personal behavior. It will be recallec 
that in the case of interpersonal be- 
havior, the first of these facets had 
to do with the degree of acceptance or 
rejection which the subject gave to 
Some aspect of the social situation 
(Foa's content facet). The social ob- 
ject was assumed to be a person, either 
oneself or another (the object facet), 
and the significant interpersonal char- 
acteristics of persons were their power 
relationships and their emotional rela- 
tionships (the mode facet). А 

The first facet of educational um 
tudes is provided by the degree И 
favorability which the subject feels 
towards various educational practices, 
the extent to which he accepts or xs 
jects a particular policy. The concep 
of favorability is used by сш 
(1954b) in his analysis of the principe: 
components of attitudes, and it € 
sponds to Foa’s content facet in the 
Interpersonal Domain. 1 

The second facet in the iE 
Domain has been assumed to be к 
object of the action (self ог Gili 
but in the process of identifying ый 
object of an attitude a question w 
Ts it more appropriate to consider eem 
the favorability of an educational а dic 
tude (acceptance or rejection) "beri 
rected toward a person (self or ot ia 
9r toward a mode of behavior ap on 
9r emotiona]) ? My assumption is 
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educators tend to be sensitive primarily 
to the overall interpersonal climate of 
the school and that they are secondarily 
concerned with the particular persons 
who may have contributed to the cli- 
mate. Therefore, the second attitude 
facet was identified as the mode (emo- 
tionality or control) of the attitude, 
retaining the title used in the Inter- 
personal Domain. 

Having assigned the mode compo- 
nent as the second attitude facet, the 
third facet was identified as the actor 
(self or other). This new term was 
used asa reminder that it was now the 
behavioral mode which was seen as the 
object of attention and not a person. 
The distinction might be characterized 
by saying that in the case of inter- 


Emotion 


Fie. 5. Circumpl 
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personal action, the subject directs his 
evaluation toward some person with a 
certain characteristic; in the case of 
educational practice, the subject directs 
his evaluation toward a social inter- 
action to which some person con- 


tributes. 

Having identified the facets pur- 
ported to underlie educational attitudes, 
the eight facet products or profiles can 


be listed as follows: 


Acceptance of control in self. 
Acceptance of control in other. 
Acceptance of emotion in other. 
Acceptance of emotion in self. 
Rejection of control in self. 

. Rejection of control in other. 
Rejection of emotion in other. 
Rejection of emotion in self. 


HOPPA 


Emotion 


ex of attitude facets. 
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It should be noted that the ш 
ing letter was taken from that pro file 
in the interpersonal-behavior domain 
to which this product most closely cor- 
responds. Thus, "acceptance of con- 
trol in self" appeared to relate most 
closely to “accepts the self’s power, 
and “acceptance of control in other = 
“accepts the other’s power. | Althoug! 
these products are not claimed to be 
exact equivalents, they form the basis 
upon which we can predict similar re- 
sponses on instruments used to meas- 
ure the two domains. 

An attitude unit may now be defined 
as the product of two or more of the 
eight facet profiles listed above. A 
single attitude usually contains the sub- 
ject’s relative preferences for control 
and emotionality in both himself and 
others. Since the facets are assumed 

to have a circular relationship, they can 
be represented visually as in Figure 5 


in a manner similar to that of the 
Interpersonal Domain, 


A comparison of the f. 
of the Interpersonal and Attitude do- 
mains can now be made. In the Inter- 
personal Domain, the eight profiles are 
defined as the product of three facets: 
the content of the action (accept or 
reject), the object of the action (self 
or other), and the mode of the action 
(emotion or power). In the Attitude 
Domain, the three facets are the favor- 
ability of the attitude (acceptance or 
rejection), the mode of the attitude 
(emotionality or control), and the 
actor in the situation (self or other), 


acet structures 


Factor ANALYSIS 


The three most com 
of factor-analytic metho 
simony, orthogonality, and psychologi- 
cal meaningfulness. The most contro- 
versial of these issues is the last. In 
a recent critique of issues relating to 
the psychological interpretation of fac- 
tors, Coan (1964) has described the 


mon objectives 
dology are par- 
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various positions which have been 
taken with regard to the relationship 
between factors and reality. At the 
minimum, a factor is a statistical func- 
tion of a pattern of correlations among 
psychological variables, and it may 
therefore be used as a convenient prin- 
ciple of classification. Depending on 
the variables used, it may also be ac- 
corded the status of an underlying trait 
lying outside the range of observation. 
Traits may be spoken of as theoretical 
constructs without reference to social 
or biological determination, or they 
may be assumed to represent funda- 
mental functions of the organism, per- 
haps at the neurological level. At a 
high level of generality the term type 
may be used to indicate a syndrome of 
attributes or trait indicators, . 
In the present study factor analysis 
provides a methodology for testing the 
Structure of a set of variables against 
that predicted on the basis of a rioa 
analysis of the relevant phenomena 
domain. In the following sections two 
standard psychological inventories are 
analyzed to determine how closely their 
empirical factor structures correspon 


to the dimensionality of their respective 
theoretical models. 


Interpersonal Attributes of Counselors 


H " inc 
As a basic Instrument for measuring 
the doma. 


in of interpersonal phenomena 
in a population of teachers or ашла 
ors, LaForge and Suczek’s (1955) 
Interpersonal Check List (ICL) has 
Several useful characteristics. As has 
been discussed above, the instrument 
comes closer than most to meeting the 
criteria of exhaustiveness in the Inter- 
Personal Domain оп both theoretica 
and empirical grounds. Although Bales 
(1951) has also set up his не 
system to have general applicabi = 
for face-to-face interactions, his Bp 
is usually adapted for use as a SY 
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tematic observation technique and does 
not lend itself well to self-description. 

Another important characteristic of 
the ICL is its objectivity. That is, it 
is composed of items which tend to 
describe the overt or public manifesta- 
tions of personality rather than the 
covert or private level (Leary, 1957, 
pp. 132-136). It therefore lends itself 
to use as a research tool in situations 
where it is appropriate or strategic to 
collect information at the public level 
of behavior, and in a theoretical con- 
text which is built upon the perceivable 
aspects of personal relationships rather 
than on constructs presumed to operate 
at the unconscious level of personality. 
The items also tend to be descriptive 
of the normal range of behaviors rather 
than the abnormal, as in the MMPI 
and other clinically based instruments, 
So that the inventory can be used with 
self-descriptive instructions with mini- 
mal arousal of feelings of threat. 
"These characteristics combine to make 
the ICL a rescarch instrument which is 
appropriate for use in educational set- 
tings and is compatible with a theo- 
retical framework of personality that 
emphasizes the role of cognitive aware- 
ness of self in the determination of 
human behavior. 


Collection of Data. A list of all fully 
certified Ohio secondary-school guidance 
counselors (approximately 500) was secured 
from the Ohio State Department of Guid- 
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ance.2 Each person on the list was sent a 
packet of materials consisting of a covering 
letter, a copy of the ICL marked “Self,” a 
copy of the ICL marked “Ideal Counselor,” 
a copy of the Teacher Preference Schedule 
(TPS)—to be described later—, a TPS 
answer sheet, a sheet of instructions for com- 
pleting the three inventories, and a stamped, 
addressed return envelope. Six weeks later 
a follow-up letter was sent to each person 
who had not yet returned the inventories. 

At the end of another month, over 400 
sets of materials had been returned, and of 
these, 350 were complete and usable for all 
three inventories. The responses of these 
350 counselors comprised the basic data for 
this study (Rinn, 1961b). Although detailed 
analysis of these data will be reported else- 
where, some of the findings are relevant to 
the present discussion of the structure of the 
Interpersonal Domain and are presented in 
the following sections. 

Factor Analysis of ICL Conventional 
Scores. When conventional scoring pro- 
cedures are used on the ICL, eight scores 
are derived by summing up responses in each 
area of psychological content (the interper- 
sonal traits) across four "levels of favor- 
ability" (Leary's "intensity levels"), and the 
items in each content area are weighted 
equally regardless of favorability level. The 
score intercorrelations for the present data 
are presented in Table 3. A principal-com- 
ponents factor solution of this matrix pro- 
duced the circular arrangement of scores 
shown in Figure 6, which can be seen to 
deviate only slightly from that predicted. 
That is, on the basis of the dimensional 


2 This part of the study was supported 
by a National Defense Education Act grant 
to the State of Ohio Department of Education 
while the author was an assistant professor 
of education at the Ohio State University. 


TABLE 3 
INTERCORRELATIONS OF ICL CONVENTIONAL SconEs 
1 2 3 4 5 6 7 8 
i 
2 61 
3 55 67 
4 38 51 64 " 
5 07 09 19 2 " 
6 13 11 16 32 68 
7 31 03 00 01 35 52 
8 29 —02 00 03 37 46 72 
! 
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Fic. 6. Factor plot of ICL conventional scores, 


analysis of the Interpersonal Domain shown 
in Figure 1 and the facet analysis of Leary’s 
types shown in Table 1, it was predicted 
that a circumplex structure would obtain 
among variables which sampled the Inter- 
personal Domain, This prediction is clearly 


ICL scores shown 


The analysis also 
upon which all 
mately equal loadings, 


When the cor- 
CL items were 


t the conventional 
scoring procedure had not grouped the items 


as efficiently as possible. Therefore, it was 


с 
decided to develop a new scoring procedit 
which would maximize the distance betw s 
item clusters and minimize the overlapping sis 
items in factor space. A new cluster ana У 
of the individual items was performed Ww o 
the aim of selecting the smallest number ee 
independent item clusters with Ше Bere 
internal reliability This analysis lysis 
greatly facilitated by the dimene the 
computer programs recently developed Cali- 
Berkeley campus of the University Tr on. 
fornia under the direction of Robert А set 
The outcome of this procedure was c 
Of 4 scores derived from 4 bep the 
10 items each which were identified ‘These 
names Dom, Hos, Sub, and Lov? n the 
4 clusters of items are encircled his plot 
factor plot shown in Figure 7. ia 
shows the relative positions of the 


35, 
“ICL item numbers are Dom: prr 15, 
36, 37, 38, 39, 42, 44. Hos: 8, 12, rs 26, 50, 
41, 43, 45, 46, 47: Sub: 18, 20, 22, А " 32, 57, 
51, 55, 56, 58; Lov: 27, 28, 29, 31, 
59, 60, 61, 64. 
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Fic. 7. Factor plot of ICL items 1-64. 


items on the ICL from which all 40 items 
que selected. The reliability coefficients of 
a clusters are satisfactorily high (.87, .77, 
81, .89, respectively) even though each 
cluster contains only 10 items. The cluster 
intercorrelations show a high degree of 
orthogonality and are reported in Table 4. 
The four item clusters are empirical repre- 
sentations of the four sectors of the inter- 
Person circumplex diagramed іп Figure 1. 


Educational Attitudes of Counselors 


The Teacher Preference Schedule 
(TPS) is the product of an investiga- 
tion by Stern, Masling, Denton, Hen- 
derson, and Levin (1960) into the 
nature of educational attitudes and 
gratifications. It has its theoretical 
base in the contention of Stern, Stein, 


and Bloom (1956) that unconscious 
personal motives contribute to teacher 
effectiveness and satisfaction. Thus, 
the study of motives for teaching may 
be seen as part of the broader investi- 
gation into the nature of motives which 


TABLE 4 


INTERCORRELATIONS OF New ICL 
CLUSTER SCORES 


Dom Hos Sub Lov 
Dom 
Hos 07 
Sub —09 15 
Lov 37 —27 34 
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underlie career choice and develop- 
ment. 

The use of the TPS is appropriate 
for the present study not only because 
secondary-school counselors come from 
the teaching ranks and can be pre- 
sumed to share at least some of the 
common motives of teachers, but also 
because the guidance function is in- 
extricably linked to the general objec- 
tives of education. Thus, although the 
motives of guidance counselors might 
be expected to differ in some ways 
from those of teachers (indeed, the 
nature of these differences needs to be 
investigated), it is quite appropriate 
and desirable to seek the expression 
of these motives within the general 
framework of educational goals and 
practices. 

'Two instruments were developed by 
Stern et al. (1960) for the separate 
measurement of teaching gratifications 
(Form G) and teaching attitudes 
(Form A). The latter (Form A958) 
was selected for the Present study as 
being an appropriate measure of the 
Attitude Domain to use with schoul 
counselors. In the Process of instru- 
ment development, 10 different pat- 
terns or roles were identified and are 
described In terms of their gratification 
and attitude components in Table 5. 
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any ordered relationship among the 10 
scales, the roles seemed to fit quite 
nicely into the dimensional structure 
hypothesized for the Attitude Domain, 
especially the pair labeled Dominant 
and Dependent and the ones called 
Critical and Nurturant. 

Factor Analysis of the TPS Scales. 
The items in the TPS scales were 
originally derived from statements 
made by actual teachers who had been 
identified as prototypes of the teaching 
roles. Significant item discriminations 
for all scales were reported (Stern et 
al, 1960) based on three samples of 
teachers, practice teachers, and teacher 
trainees, 

Stern et al. do not appear to have 
investigated the interscale relations in 
a systematic way, but they did report 
that simple inspection of tetrachoric 
correlation matrices revealed two clus- 
ters of scales. Cluster 1 consisted of 
the Exhibitionistic, Nurturant, Pre- 
adult, and Nondirective scales and was 
labeled child centered, Cluster 2 con- 
sisted of the Practical, Dominant. Or- 
derly, and Dependent scales and was 
labeled teacher centered, The other 
two scales (Critical, Status-striving) 
correlated with both clusters, and the 


interpretation was offered that these 
two 


Although St : attitudes might have different 
ern ; з 1 g : 
8 ct al. did not specify manifestations depending on whether 
TABLE 5 
E ANALYTIC SUMMARY ор TPS Scars 
xd тыы сал а=сэхе шаш ШЕБЕ 
oe Gratifications Attitud 
itudes 
1. Practical I 
£s nstrumental ге 
A REI GU. restige Wards Detachment "m 
li Ыла Children’s affection ани dignity 
5. Critical en = nom Encouresing self вип 
6. Preadult Vies ing teachers’ rights Reformin stuns 
7. Orderly ied мер Identification with children 
8. Dependent ешн i тез. р н il habits 
9. Exhibitionistic | САГ from superiors Coa уч = 
. Exhib istic Children’s admiratio, ooperation with auth 
10. Dominant à 


Children's obedience 


Showmanship 
Maintaining discipline 
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TABLE 6 
PS INTERCORRELATIONS OF TPS ScaLes 
t 2 3 4 6 7 8 9 10 
1 
2 —09 
3 —09 AT 
4 —12 33 53 
5 48 | —09 06 06 
6 13 28 54 38 35 
7 39 29 26 03 35 30 
8 15 48 43 24 00 34 51 
9 24 15 35 25 39 45 36 23 
10 56 05 | —01 | —26 45 21 60 28 32 


they were associated with a child- 
centered or a teacher-centered orienta- 
tion. In terms of overall mean differ- 
ences between the scales, there was 
reported a consistent tendency for the 
Cluster 1 scales to receive the higher 
means and the Cluster 2 scales the 
lower, 

. To begin the investigation of the 
interscale relationships for the present 
sample of 350 counselors, the correla- 
tion matrix was calculated and is pre- 
sented in Table 6. Inspection of this 


2 
o 
90 
oÊ 
10 
100 
2 
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matrix showed several obvious dis- 
crepancies from the two-cluster pattern 
reported by Stern et al. (1960). Since 
the matrix utilized the Pearson corre- 
lation coefficient and was based on a 
larger sample of persons than the origi- 
nal study, it was feasible to perform a 
standard factor analysis of the 10 


scales. 

A  principal-components solution 
yielded only three significant factors by 
the criteria of abrupt slope change on 
the eigenvalue curve (eigenvalues of 


o^ 


©з 


e? 


© 
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Factor plot of TPS scales. 
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3.39, 1.73, and .53). The first d 
loaded approximately equally on a 
scales, corresponding to the кча 
factor in the ICL analysis. я was 
similarly interpreted as primarily е 
to the elevation variance, that is, to 
the tendency among persons to Es a 
particular range on the answer sheet 
when responding to inventory items. 
A plot of the loadings of the 10 scales 
on the second and third factors is 
shown in Figure 8. In addition to 
moditying the general picture given by 
Stern et al., this analysis revealed that 
the TPS scales were related in a quasi- 
circumplex pattern similar to that 
found to underlie the ICL. 

Since the TPS factor. pattern was 
similar to that of the attitude circum- 
plex which had been developed on 
theoretical grounds, its dimensionality 
was examined for correspondence by 
means of a rational analysis of the TPS 
scales and items. This analysis re- 
vealed several interesting characteris- 
tics of the TPS scales which had not 
been reported by Stern et al. (1960). 

Although most TPS 
found to refer to only tw 
initiator and a receiver 
items included three c1 
teachers, students, an 
(the latter category 
boards and parent 
meant that the inter 
component of st 
to either of two 
relations and te 
lations. 
describe 


items were 
О actors (an 
), the full set of 
asses of persons: 
d administrators 
including school 
groups). This 


‚ Scales 3, 4. 7, 10 
student relations and 


8, to teacher-administra- 
tor relations. Scales 6 and 9 referred 


mainly to nonrelational, internal states 
of the teacher and posed a special 
problem that is discussed below, 
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With regard to the quality of the 
interpersonal relationships, the er 
zontal axis in Figure 8 was found to 
correspond to the emotionality ee 
sion hypothesized for the Attitude е 
main. Thus, Scales 3, 4, 2, 8 ое 
sociable or affectionate relations, an 
Scales 1, 5, 7, 10 reflected critical or 
detached relations, h 

Upon analyzing the scales along t i 
vertical axis of Figure 8, it was notice 
that at one pole Scales 3 and 4 вор" 
ported the rights of students Ln 
press themselves, to make genium 
and the like, and Scales 1 and 5 € 
for the rights of teachers in matters o 
policy making, school reform, and SeA 
sonal independence, At the other pole, 
Scales 7 and 10 denied student ш 
pendence апа initiative, and Scales 
and 8 supported teacher dependence 
upon and cooperation with the author- 
ity structure of the school. All of this 
Suggested that the vertical axis in р 
ure 8 reflected the concept of ee 
autonomy wherein relative indepen T 
ence of action was contrasted with 
conformity to rules and authorities. 
Although the concept of autonomy i- 
clearly related to the hypothetical а 
mension of control, it appears in E 
Complex form in the TPS. That 15, 1 
Appears on various scales in the даир 
of teacher independence of administra 
tors (Scales 1 and 5), student reid 
pendence of teachers (Scales 3 and i 
freedom for the teacher to be a a 
man (Scale 9), and freedom for th 
child to be a child (Scale 6). е 

П terms of the above analysis, 0) 
advocate of Stern et al.’s 1997 
"teacher-centered" syndrome hec 
favor teacher autonomy and staden 
obedience, while the Цер 
teacher would support шга, anii 
omy but give obedience to his OY 
superiors, . e 

TPS Cluster Scores. Since pe 
analyses of the 10 TPS scales indica 
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TABLE 7 


CORRELATIONS BETWEEN ICL CLUSTER SCORES AND TPS SCALES 


TPS scales 

ICL 

scores | | 
c "ra | = | ax 
т Pra | Dom | ога | Der | Sa | Nur | Non | Exh Pre 

Гов —03 | —06 02 19 15 17 оз | —02 | —02 | —04 
ps 05 08 12 01 —0; | —08 | —11 | —06 00 | —08 
es 07 08 24 25 19 12 17 | —05 15 16 

ov 06 | —02 | 00 27 25 20 28 10 14 15 


Note.—r = .14 is significant at the .01 level. 


that internal reliabilities and relative 
independence of clusters were com- 
parable to those reported in the origi- 
nal study, and since the rational analy- 
ses described above showed that the 
various scales had different facet struc- 
tures with regard to the autonomy 
dimension, it was decided to retain the 
original scoring formulas rather than 
to form quadrant scores as was done 
with the ICL. Therefore, 10 cluster 
scores were calculated for each coun- 
selor and were identified by means of 
the original TPS numbers and names 
listed in Table 5. 


DomAIN RELATIONSHIPS 


Three events now led the author to 
expect certain relationships to obtain 
between the measures of the Attitude 
and Interpersonal domains. First, the 
two domains were described in terms 
of the same set of facets, and the facet 
elements in each case were related in 
terms of a circular order. Second, the 
responses to the measures of each do- 
main were such as to produce circum- 
plex structures of variables which ap- 
peared to correspond to the theoretical 
designs of the domains. The author’s 
own need for consonance led to the 
thought that à counselor who engaged 
in, say, *dominative" interpersonal be- 
haviors would subscribe to a “direc- 
tive” educational philosophy, and that 


X^ 5 
a counselor who engaged in affection- 


ate" behaviors would prefer a “socia- 
ble” approach to teaching, and so on. 

Thus, the pairs of scores from 
different instruments having similar 
structures were examined with the ex- 
pectation that scores derived from 
similar sectors of the respective cir- 
cumplexes would covary positively and 
that scores derived from complemen- 
tary sectors would covary negatively. 
These expectancies were tested by cor- 
relating the four ICL cluster scores of 
the 350 counselors with their 10 TPS 
scale scores. These correlations are 
presented in Table 7 where the TPS 
scales are reordered in terms of the 
circular factor pattern shown in Fig- 
ure 8. 

Inspection of Table 7 showed that 
the correlations in general were not as 
high as might be expected on purely 
rational grounds, and that more of the 
existing correlation was related to the 
axes representing emotion and emo- 
tionality than to those representing 
interpersonal power and attitudinal 
autonomy. To some extent this find- 
ing may be attributed to the complex 
construction of the TPS scales along 
the latter dimension, but jt seems also 
to suggest that counselors feel more 
“of one piece” with regard to their 
affective characteristics and relation- 
ships than they do with regard to their 
influence relationships. That is, it is 
hypothesized that counselors as а 


464 


group have more conflict and anxiety 
associated with the question of how 
much control to exert over students 
and colleagues than they do with the 
question of what is appropriate emo- 
tional expression. The author s expe- 
rience with counselors in training tends 
to agree with this hypothesis, and it 
can also be pointed out that more heat 
has been generated over the directive- 
nondirective issue in counseling than 
any other. 


Counselor Types 


A final characteristic of the phe- 
nomenal model which has not yet been 
discussed is the construction of phe- 
nomenal syndromes, profiles, or types. 
Following Guilford’s (1959, рр. 99. 
106) hierarchical conception of person- 
ality structure (cf. Coan, 1964, p. 137), 
a syndrome type is identified by a pat- 
tern of positive correlations among a 
set of distinct attributes or traits, It 
is suggested here that the phenomenal 
syndrome should contain attributes 
from two or more domains and that 
this procedure will permit the identi- 
fication of congruent a 
personality profiles, 
of a congruent Profile would be the 
counselor who takes a dominant inter- 
personal role and who justifies his 
behavior in terms of a directive edu- 
cational philosophy or attitude. In con- 
trast, an incongruent counselor might 
be one who behaves dominantly but 
who claims to believe in permissive 
educational Procedures, 

, Because of the fact that the two 
Instruments selected to Tepresent the 
Interpersonal (ICL) and Attitude 
(TPS) domains Proved not to have 
completely parallel structures, it is not 
possible to give empirical support at 
the present time for the full set of 
hypothetical profiles, However, the 
construction of empirical profiles can 
be illustrated by utilizing the patterns 


nd incongruent 
Thus, an example 
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of correlation which actually exist 
among the present variables. When 
each ICL score in Table 7 was exam- 
ined in terms of its significant correla- 
tions with TPS scales, it was possible 
to construct four heuristic counselor 
syndromes. Since some attitudes (or- 
der, dependence ) correlated positively 
with all ICL types (except the "criti- 
cal” counselor), those elements were 
considered to be general effects—per- 
haps occupational characteristics—and 
were omitted from the syndromes. 
Each profile consists of a typical 
interpersonal-behavior pattern with its 
supporting attitudinal rationalizations. 
The actual phrases are taken oF 
adapted from items in the ICL and 
TPS instruments, and since these are 
self-report inventories, the profiles are 
stated in the first person. 


ә 


Managerial Counselor: I am self- 
confident and assertive, and I guide 
and direct people. They admire and 
defer to me because I maintain pro 
fessional dignity and propriety. ; 

Detached Counselor: I am skepti- 
cal and sometimes indifferent, and I 
can be critical and irritable. I get 
little gratification from my work. . 

Conforming Counselor: I admire 
and imitate others, and I want their 
advice and approval, Children cheer- 
fully obey me because I give them 
loving discipline. 

Sociable Counselor: I am always 
friendly and agreeable, and I help and 
encourage others. Children hold me 
in esteem because I give them affection 
Without losing professional dignity. 
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Braine clearly recognizes that the 
ability to deal with novel sentences 


constitutes a major part of the i 
linguistic competence. It is thus central 
among the phenomena that a theory of 
language learning must explain. Braine 
takes the ability to construct and com- 
prehend novel sentences to be a special 
case of transfer of training based upon 
stimulus and response generalization. 
Specifically, contextual generalization is 
identified as a case of perceptual learn- 
ing, “a process of auditory differentia- 
tion, or of becoming familiar with, the 
temporal positions of expressions in 
utterances [p. 326].” 


ARGUMENTS FOR CONTEXTUAL 
GENERALIZATION 


Braine notes the inadequacy of a 
theory of syntax acquisition based on 
associative relations between lexical 
items. Such a theory cannot account 
for the ability of Speakers to recognize 
the grammatical Structure of nonsense 


material Thus, a nonsense syllable, 
kivil, is recognized as a nonce verb in: 


1. People kivil every day. 


That is, 1 is recognized as syntactically 
well formed despite the lack of associative 
connections between the words. 

Such examples show that the speaker's 
ability to exploit syntactic relations does 
not depend upon forming associative 
bonds between lexical items. 


Therefore, 
Braine argues, the Speaker's information 
about syntactic structure primarily con- 


cerns the grammatical proper 
tions in sentences. 
such facts as: 
simple English 
tically the noun 


ties of loca- 
Thus, the child learns 
The first position. in a 
Sentence is characteris- 
position; the second posi- 
tion is characteristically occupied by a 
verb. Since the syntactic Properties of 
a position do not, by definition, depend 
on the lexical item or phrase that appears 
in that position, novel material in a given 
location is perceived as having the gram- 
matical properties characteristic of that 
location. Thus kivil is recognized as a 
verb in 1 because it appears in the posi- 
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tion assigned the verbal element in such 
sentences as: 


2. The boys eat the rabbits. 
3. The boys do eat the rabbits. 


For Braine, then, the syntactic proper- 
ties of a segment are determined by the 
locations in which it occurs. A verb > 
defined as a word which charactristically 
appears in the second position in simple 
sentences, a noun is a word which char- 
acteristically appears in the first n 
in simple sentences, etc. To learn ee 
Syntactic properties of a word is ре. 
marily to learn the positions in which 1 
сап occur. In particular, given the pos! 
tions a word can occupy in one genteni 
we can often predict the positions нен 
it may occupy in new sentences. Fro d 
the fact that ca! occupies the met 
position in 2, we can predict that it w 
occupy the homologous position in 4. 


4. The wolves eat the rabbits. 


Syntax assimilation. thus consists "a 
generalizing information about the os 
tions in which a word is observed 
occur. The correct use of a given ee 
in a given position in new sentences n- 
а consequence of such processes of 8? 
eralization, rd 

Braine holds that a description of ч ШП 
order accounts for much of the gramm a 
of English and, consequently, that o 
theory which accounts for the learning x- 
Positions will have considerable d 
Planatory power, Braine admits, s 
ever, that learning syntactic imag? 
cannot consist solely of learning ords 
appropriate relative positions of W out, 
and phrases. First, Braine points ou 
knowledge of relative positions W lan- 
contribute little to the mastery Я of are 
guages in which syntactic reler jen: 
expressed by inflection rather than 0 t 
Second, the notion of a position pnt a 
construed sufficiently abstractly so upie 
given sentential position can be C Notice, 
either by a word or by a phrase. — ( sition 
Íor example, that the "second Pionally 
in 3, ie, the position ис “eat” 
equivalent to the one occupied ру eat.) 
in 2, is filled by the phrase "do 
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Some explanation is required for the fact 
that phrases may exhibit positional 
Privileges analogous to those exhibited 
by single words. In short, an explana- 
tion is needed of how phrases can act 
as syntactic units. 

Р To accommodate inflection as а syntac- 
tic device and to account for the integrity 
of the phrase, Braine resorts to a limited 
associationism, Не postulates associa- 
tive bonds between "closed-class" mor- 
phemes (eg. inflections) and “open- 
class" morphemes such as nouns and 
verbs. For example, Braine would 
presumably hold that the phoneme "s" 
at the end of a noun is associated with 
the lack of a phoneme "s" at the end of a 
following verb and conversely. Such as- 
Sociations hold for simple declarative 
Sentences like: 


5. The boy eats the rabbits. 
6. The boys eat the rabbits. 


Braine believes that the formation of 
associations, augmented by position 
learning, is adequate to explain how the 
Syntax of simple declarative sentences is 
learned, In effect, Braine considers 
such sentences to be sequences of "pri- 
mary phrases.” Primary phrases are 
themselves sequences of open- and closed- 
class morphemes connected by associative 
bonds. “The location learned is that of 
a unit within the next-larger containing 
unit of a hierarchy of units. There are 
hierarchies at two levels: within sen- 
tences the units are primary phrases and 
Sequences of primary phrases; within 
primary phrases the ultimate units are 
morphemes [p. 348]." 

Braine is aware that the information 
that certain sequences of linguistic ele- 
ments behave as units and that such units 
can appear only in specified positions in 
Simple sentences does not exhaust the 
Speaker's knowledge of syntax. There 
are many different kinds of sentences 
allowing nearly all possible orders of 


Words and primary phrases. Thus, if we 
take into account all the types of sen- 
tences in which it may Occur, there are 
indefinitely many permissible ne 
of a linguistic unit (see examples, se 
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tences 35-41). A list of the positions 
available to a linguistic unit could at best 
specify its behavior in only a circum- 
scribed part of the language. Yet it is 
only the learning of such a list that 
contextual generalization could explain. 

Braine meets this objection by restrict- 
ing the scope of his theory to the as- 
similation of the grammatical properties 
of simple declaratives. He maintains 
this restriction is not arbitrary since 
simple declaratives have psychological 
and linguistic characteristics which 
justify postulating special processes for 
their assimilation. Braine thinks simple 
declaratives may predominate in the 
child's verbal environment, thus forming 
the primary models from which the 
child’s knowledge of his language is 
extrapolated. Second, Braine claims re- 
cent work in linguistics divides grammar 
into two parts. 


According to Harris (1957) and Chomsky 
(1957), the grammar of a language can be 
hierarchized into an elementary part, called 
the “kernel” of the language, and a second 
part which consists of a set of transforma- 
tional rules for deriving complex sentences 
from simple ones. The kernel grammar 
contains the definitions of the main parts 
of speech and describes rules for construct- 
ing simple declarative statements . 


[p. 340]. 


Thus, if we can explain the acquisition 
of simple declarative sentences, we have 
accounted for the basic component of the 
The remaining portion—the 
complex sentences produced by trans- 
formation—is to be described as a set of 
sublangwages, one sublanguage for each 
type of sentence (passive, relative, ques- 
tion, etc.). Rather than attempting to 
study English in its full complexity, 
Braine concludes ". . . it seems that it 
would be sound strategy to aim first at 
finding an explanation for the learning of 
the kernel of the language, ie, for the 
learning of the structure of the simple 
declarative English sentence. This con- 
stitutes enough of a problem already 
Гр. 3421.” 

Finally, 
learning, of w 


grammar. 


Braine argues that perceptual 
hich contextual generaliza- 
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tion is a special case, is a primitive 
process which does not demand much in 
the way of intellectual capacity of the 
learner. Contextual generalization would 
therefore satisfy at least one requirement 
on any process involved in first-language 
learning, namely, that it “not require 
intellectual capacities obviously beyond 
the reach of the 2-year-old [p. 326].” 

If the learning of syntax is the gen- 
eralization of the ordinal positions in 
which linguistic units appear, it is evi- 
dent that the initial stage must consist in 
the perceptual isolation of such units. 
Braine claims that an argument for the 
feasibility of contextual generalization is 
that the boundaries of such units can be 
identified with certain specifiable proper- 
ties of the speech signal. 

Braine proposes two sorts of cues the 
child could use to identify these bound- 
aries. One is "intonation": The Stress, 
rhythm, and pitch patterns of sentences 
are assumed to be acoustic features 
which communicate information about 
segmentation. The other is the position 
of closed-class morphemes Which, Braine 
holds, tends to delimit phrases, 

We now turn to a discu: 
arguments. W 


issues raised by 
inflection, intona- 
isolation of units. 
his experimenta] 


THE Rote or DECLARATIVE 
SENTENCES 


Because he believes that a theory of 
the simple declarative sentences explains 
the basic part of the grammar and that 
such sentences predominate in the child’s 
linguistic environment, Braine holds an 
account of the learning of simple declara- 


tives is important even if it does nothing 
else. We shall return to the question 
of the kernel grammar presently. Let us 
first consider the claim that the verbal 
environment of the child exhibits a pre- 


ponderance of grammatical simple de- 
claratives. 


It is clear that normal speech among 
adults does not exhibit any statistical 
bias towards fully grammatical simple 
declarative sentences. On the contrary, 
adult speech is usually ungrammatical 
(cf. Maclay & Osgood, 1959), and there 
is little evidence that adults engage in 
а careful limitation of their linguistic 
Output when conversing with children.? 
Moreover, the verbal environment of 
children includes utterances produced by 
adults conversing among themselves. 
utterances produced by siblings with little 
command of the language, utterances 
heard on radio and television, etc. These 
diverse sources presumably form а 
heterogeneous verbal environment. 

Even if simplified speech predominates 
in the child's verbal environment, there 
is no reason to Suppose that the environ- 
ment is unusually rich in simple declara- 
tives. Analyses we have made of the 
Speech of mothers taped during conversa- 
tions with their children fail to support 
that hypothesis On the contrary, of а 


? Brown and Bellugi (1964) do find а 
relatively large Proportion of fully gram- 
matical utterances in their recordings © 
mothers’ speech to children. They do nol, 
however, find that simple declaratives аге 
Preponderant among such utterances. ОЛ 
the contrary, in the only sample of their data 
they Present (a sample which they say 15 
rather Tepresentative"), only one of ү, 
SIX sentences produced by the mother is 9 
the simple declarative type. 

‘The data were supplied by Margaret 
Bullowa and her staff at the Massachusetts 
Mental Health Center. The total represents 
38 half hours of recorded conversation be- 
tween three mothers and their children 
taped at ages ranging from 6 to 30 months- 
Six transcripts were selected for analysis, 
greater weight being given the recording? 
made at 20 months than those made at 
months, i 

Neither our judgments of grammaticality 
nor sententia] type were checked for inter 


THEORETICAL NOTES 


total of 432 utterances, 258 were fully 
grammatical. Of these, only 46 were 
simple declaratives. 

OF course the character of the verbal 
environment plays a major role in lan- 
Suage acquisition. It determines which 
language, vocabulary, style, and accent 
the child learns. What is unknown, 
however, is which features of the verbal 
environment are critical for such learn- 
Ing. There is, at present, no reason to 
believe that the learning of English is 
facilitated by a preponderance of simple 
declaratives in the child's sample of his 
language. Nor is there any reason to 
Suppose that such a bias normally ob- 
tains, 

We turn now to the question of 
Whether the simple declarative has any 
formal or linguistic peculiarities to which 
its claim for special psycholinguistic 
Status might be referred. 

Braine makes a mistake that has un- 
fortunately been common in psychological 
investigations concerned with generative 
Srammar.? He supposes there exists a 
base or kernel grammar producing all 
and only simple declaratives and that 
the transformational operations produc- 
ing complex sentences are defined over 
the declaratives generated by this base 
component. If this were the case, one 


judge reliability. While some degree of 
latitude may be involved in judgments of 
the former kind, the criteria for the latter 
are reasonably objective. It is clear that 
there is need for an extensive survey of the 
verbal environment of the child; the data 
We have cited are intended only as pre- 
liminary, 

For examples of discussions in which 
this mistake appears to have been made, 
See Miller (1962); Miller, Galanter, and 
Pribram (1960); Osgood (1963); and 
Mehler (1963). All these assume that 
linguistics assigns а privileged status to 
the simple declarative. For example, Miller s 
(1962) discovery that it takes less time to 
nd the passive corresponding to an active 
than to find the passive corresponding to a 


uestion i xplained by appeal to the 
dre dod Apt tive is the under- 


linguistic fact that the ac е 
lying form in the production of the ра Ше 
and the question. There js no such linguistic 
fact. 
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might plausibly maintain that the status 
of the simple declarative as the under- 
lying linguistic form justifies a parallel 
psycholinguistic precedence. 

Braine is correct in asserting that 
there is a base form from which all 
syntactically related sentences are di- 
rectly or indirectly derived. It is also 
true that base form is produced by a 
set of rules whose formal properties dis- 
tinguish them from other rules in a 
generative grammar. But it is not true that 
the base form is the simple declarative 
sentence. The kernel grammar does not 
produce simple declarative sentences; it 
does not produce апу sentences. Rather, 
the kernel grammar produces abstract 
structures that are transformed into a 
variety of different sentence types of 
which the simple declarative is one. In 
particular, the kernel sentence discussed 
by Chomsky (1957) should not be con- 
fused with these abstract structures. 
Kernel sentences differ from sentences 
of other types solely in that they are the 
consequence of applying only obligatory 
transformations to the kernel structure. 
Kernel sentences are thus in no sense the 
source for, or underlying form of, sen- 
tences of other syntactic types.® 

Since the misunderstanding of the 
kernel notion has been widespread, it is 
worth indicating some of the linguistic 
considerations that militate against sup- 
ng the simple declarative to be the 
underlying form from which other sorts 
of sentences are derived. Consider the 
passive construction. We might attempt 
to derive 7b from 7a by a rule like that 


given in 8. 


posi 


7a. The boy chases the dog. 
7b. The dog is chased by the boy. 


‚ 
6That this has always been Chomsky’s 
view is clear from a reading of Syntactic 
Structures. That Harris does not hold the 
simple declarative to be the base form a 
lows from the fact that the notion 0 a 
derivation plays no role in Harris theory. 
The mappings Harris. (1957) employs in 
transformational analysis are characteristic- 
ally symmetrical, hence there can be по 
questions of identifying an underlying 


syntactic form. 
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8. If NP, Verb NP, is a declarative 
sentence, and if NP, is the object of the 
verb, then NPs is Verbed by NP, is the 
corresponding passive.” 


But now, consider 10, the result of apply- 
ing 8 to 9: 


9. The boy chases the dogs. 
10. *The dogs is chased by the boy. 


a string which is not grammatical for 
many dialects since the number of the 
verb should be determined by the subject. 
To avoid 10, Rule 8 must be split into 
two rules: 


8а. NP; У NPo+sg—NP, is V 
+ ed by NP. 

8b. NP; V МР, +р —> МР, +5 are 
V + ed by NP}. 


However, consider 12 and 14, the result 
of the application of 8a to 11 and 13 
respectively.? 


11. The boy is chasing the dog. 


12. *The dog is is chasing ed by the 
boy. 


We shall adhere to the notational 
conventions employed by linguists. In par- 
ticular, ungrammatical strings will be 


preceded by *, @ 

number of а linguisti 
the „English word “ “sheep 
+g”). will 
also be employed: NP for Noun Phrase, T 


stands for the zero 


tense morpheme, 
Det for determiner. 


tions and rules, ei 
which we show 
or for those that are essentially correct, 


text 
In that con- 
text they might appear somewhat differently 
—but the distinctions and characteristics with 
which we are concerned will remain un- 
changed. 
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13. The boy chased the dog. 
14. *The dog is chased ed by the boy. 


Just as the difference in the number of 
the object required us to adopt different 
passive rules for 7a and 9, so two more 
rules, 8c and d, will be required to pas- 
sivize 11 and 15: 


15. The boy is chasing the dogs. 


8с. NP, is V ing МР, + sg — МР» 
+ sg is being V ed by NP.. 

8d. NP, is V ing NP, -+ pl—NP2 
+ pl are being V ed by NP.. 


and similarly for 13 and 16. 
16. The boy chased the dogs. 


8e. МР, V+ed NP, +sg — NP2 
+sg was V + ed by NP}. 

8l. NP, V ed NP, + pl 5 NP; + pl 
were V + ed by NP,, 


In general, if we derive passives from 
their corresponding actives, a different 
Passive rule is required for each choice 
of object number and verb tense. For 
five tenses and two numbers there are 
at least 10 rules required to derive the 
simple passive from declarative sentences- 
Furthermore, even these 10 rules will not 
Serve to derive the passive of more com- 
plicated sentences, For example, we wil 
need special rules for: 


16a. Does the boy chase the dog? 
16b. Is the boy being chased by the 
dog? 


17a. Why does the boy chase the dog? 


17b. Why is the dog chased by the 
boy? 


and so on, 


In each of the cases we have discussed 
the problem clearly arises from the at- 
tempt to derive the passive from its cot 
responding declarative. This difficulty 
would be avoided were it possible to 
define the transformation which Te- 
arranges the subject and object phrase 
$0 that it applies prior to the attachme? 
of tense and number to the verb. 1° 
this way we specify the operations ОЛ 
the noun and verb relevant to ges, 
tion without reference to the particula 
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S 
NP. 
det jam aux 
N no no oux а 
the boy SG SG present 


Fic. 1. Tree diagram of the underlying (untransformed 
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Mp JM 


| ~no 
dé м 
aux b V | s. 
| the dog pl pl 
be+ing chase 


) structure of the sentence : 


“The boy is chasing the dog.” 


choi 

МЫТ о} tense and number they ex- 
: И їп a given sentence. This rule is 
asily formulated : 


sb? NP, aux V NP, is rewritten as 
2 aux be + Past part. V by NP). 


sees, it can be applied to generate 

л. only if suitable abstract 

iid fia ue of sentences are provided 

ma omain. For example, the passive 
ence, 19: 


1 Р 
Б The dogs are being chased by the 


has, as its corresponding active, Sen- 
pe 15. Yet, if 19 is to be generated 
by an application of Rule 18, 15 cannot be 
its source, since 15 does not possess the 
formal properties required of strings in 
the domain of 18. No definite sense can 
therefore be given to the notion of apply- 
ing 18 to 15. Rather, the source for 19 
is the abstract illustrated in Figure 1.9 


9? Of course, the result of 18’s application 
to the structure in Figure 1 is "of ап actual 
Sentence. The full derivation of Sentence 19 
from the structure represented in Figure 1 is 
accomplished in at least three steps: (a) 
Rule 18 converts it into the string (constitu- 
ent structure is not marked here) the dog pl 
+Past part. chase by the 
-attachment rule (af- 


fix, V—V -+ affix) ther 

the string the dog pl bet lt pres be+ing 
chase+Past part. bY the boy sg sg; (с) 
morphophonemic “spelling” rules convert 
this into Sentence 19 (see the paragraph fol- 
lowing 33 in this paper and Chomsky, 1957, 
for other examples of afix movement and 


Sentence derivations). The concept of the 


It can be demonstrated that derivations 
employing Rule 18 can account for the 
examples we have investigated so far. 
This means, in effect, that we have 
eliminated the proliferation of passive 
rules by insuring that inflections of num- 
ber and tense are not specified by the 
To do this, however, we 
have had to assume that the underlying 
form from which the passive derives is 
not a corresponding active but rather an 
abstract structure never realized іп 
speech. Since analogous considerations 
show that the active itself is merely one 


of the deformations of this underlying 
to be 


passive rule. 


structure, there would appear 
nothing in the linguistic theory of the 
sentences which 


derivation of English 
would justify assigning a speci 
to active declarative sentences. 


al status 


e level in sentences isa 
f many intuitions : for 
between “logical” and 


abstract and concret 
formal representation 0! 


example, the difference 
“apparent grammatical relations, the de- 
relations among sentences, and 


so on (see Fodor & Katz, 1964, for various 
discussions). Notice that by having the 
affix-attachment rule follow the passivization 
rule, 18, we avoid formally the multiplica- 
tion of passive rules. Thus, this analysis is 
a formal account of the intuition that “pas- 
sive” sentence js a unitary notion, and the 
intuition that it is the number of the ap- 
bject of the verb which determines 
ber in English, not the number of 
“logical” subject (see the 
te 10 in this 


parent su 
verb num 


the underlying or 
section on inflection and Footno 


paper). 
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CoNTEXTUAL GENERALIZATION AND 
THE LEARNING OF PosITION 


ose, however, that we grant 
aoe assumption that there is some 
point to a special theory of the psycho- 
logical processes underlying the learning 
of the syntax of declaratives. We must 
ask how much of the character of Such 
sentences emerges from an analysis of 
the order relations among their constit- 
uents. We shall see that very little of 
the speaker's knowledge of the syntactic 
structure of simple declaratives can be 
attributed to having learned such rela- 
tions. 

To see how little of the syntax of a 
simple declarative is expressed by order, 
it is only necessary to consider sentences 
whose grammatical structure is not so 
expressed. Thus a speaker who knows 
that 20 and 21 are Sentences also knows 
that while 22 is perfectly good English, 


there is something wrong with the Syntax 
of 23. 


20. The kangaroo cost 10 dollars. 
21. The child blew the kazoo. 
22. The kazoo was blown by the child. 


23. *Ten dollars Was cost by the 
kangaroo, 


This example 
rassing for Brain 
hibit Precisely the 
tions on the basi 
generalization 


is particularly embar- 
е, since 20 and 23 ex- 
Sort of positional rela- 
5 of which contextual 
is supposed to 


operate. 
"That is, since both “10 dollars" and "the 
kazoo" appear postverbally, and since 
"The Ка; 


€ preverbal ap- 
The 


nce between 
} Though this dif- 
ference is not revealed by their positional 


privileges in simple declaratives, it 
decisive for determining 
tion operates in sentences 
occur. 


A similar case is the following : 


24. John phoned Jane up. 
25. John phoned up Jane. 


is 
how passiviza- 
in which they 
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26. John phoned her up. 
27. *John phoned up her. 


Since “up” appears in the fourth and 
third positions respectively in 24 and 25, 
and since it appears in the fourth posi- 
tion in 26, Braine's theory predicts that 
place generalization requires its occur- 
rence in 27. This prediction is incorrect. 
On the contrary, it is a necessary condi- 
tion upon the grammaticality of sentences 
of the form: 


28. John phoned up X. 


that X not be a pronoun. Hence, if ws 
are able to formulate the rule velie 
permits 25 but precludes 27, we med 
take account not only of the ordina 
Dositions of the words in those HE 
but also of the syntactic classes to which 
the words belong. f 
These examples are a consequence 0 
a quite general fact: the types of г 
sion that can appear at a given ordina: 
position in a simple declarative zi uua 
are extremely heterogeneous. This fac 
is of the utmost importance for an еш. 
tion of Braine’s theory. For it ental’ 
that contextual generalization is an A 
adequate mechanism for extrapolating d. 
Srammatical regularities the child O 
Serves in his language. Many types e 
expression can appear at a given s 
tential position. Hence sentences dns 
hibiting precisely the same ordinal ге = 
tions among their segments may n 
theless be of different syntactic types: 
Since the operation of contextual gen 
eralization requires only that m 
Order be common to conditioned ME at 
and generalized stimulus, it follows = 
we cannot infer that String S is a per 
tence whenever there exists a иар 
S', related to S in a way that satisfies t 
conditions for contextual generalizaticl 
To put it slightly differently, ed 
examples just discussed demonstrate “he 
Some of our information about send 
Syntactic character of simple declaran a 
cannot be expressed in terms of loca the 
information ; rules which determine Беа 
interrelations among simple declarat 
and the relations between simple ces 
Claratives and other sorts of senten 
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distinguish between expressions that can 
appear in a given location. 

The similarity between Braine’s view 
and that of Jenkins and Palermo (1964) 
is most evident at this point. Jenkins 
and Palermo apparently hold that the 
syntactic structure of a sentence can be 
expressed by a sequence of markers 
representing the classes of which the 
words comprising the sentence are mem- 
bers. Thus, they remark that ©... the 
ideas we want to present here are of two 
ет sequence and class [p. 164]" and 

at 


"Colorless green ideas sleep furiously" is 
a sentence in English not because it is true, 
Or sensible, or interpretable by the listener, 
but because it is a "correct" assembly of 
classes appropriately modulated (i.e. they 
are the right general classes of entries 
Properly modified to take their places in 
the particular sequence). . . . The critical 
question, then, is seen to be that of the or- 
j^! of the elements into classes [p. 


It is thus evident that Jenkins and 
Palermo's view of syntax is simply a 
weaker version of Braine's. For while 
Jenkins and Palermo acknowledge only 
Sequences of class markers (and inflec- 
tion), Braine has noticed the necessity 
of providing some psychological mechan- 
ism to account for the phrase structure 
of such sequences. 

In short, there is no basis for Jenkins 
and Palermo's belief that the employ- 
ment of mediational paradigms affords 
a breakthrough in the study of syntax 
assimilation. Though the formation of 
word classes can perhaps be accounted 
for by such paradigms, they throw no 
light whatever upon the assimilation of 
even such relatively superficial syntactic 
features as phrase structures. It goes 
without saying that appeals to them 
are utterly unilluminating on the question 
of how the child learns the deep syntactic 
structures with which this paper is 


primarily concerned. 


Tur DERIVATION OF ORDER 


noted one sense in 


Thus far we have 1 
- atively unimportant 


which order is a rel 
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feature of syntactic structure even in 
English where inflection is not widely 
used: Identity of order relations is com- 
patible with considerable differences in 
syntactic form. There is a more im- 
portant point which also stems from 
Braine's failure to distinguish between 
underlying and surface structure. Often 
a correct formulation of the rules deter- 
mining syntactic structure requires dis- 
tinguishing between the order of lexical 
items in the sentence and in the underly- 
ing representation from which the sen- 
tence is derived. While contextual gen- 
eralization might conceivably account for 
learning the former, it is patent that it 
could not account for the learning of the 
latter. ' 

For example, in our discussion of the 
passive we treated be + ing as а unit in 
the underlying representation of sen- 
tences like 15. There are intuitive and 
formal advantages to this treatment. It 
permits us to account for the intuition 
that be +ing functions as a semantic 
unit indicating a particular mode of the 
main verb. Moreover, treating these 
morphemes as a single item permits us 
to account for the fact that such strings 


as: 


29a. *The man is chase the dog. 
29b. *The man chasing the dog. 


are not sentences: If be ing is repre- 
sented as a lexical unit, one of the mor- 
phemes comprising that unit cannot be 
selected without also selecting the other. 

Cases in which simplicity of representa- 
tion and linguistic intuition require an 
underlying order differing from the order 
of elements in the manifest sentence are 
found throughout English. Thus in sen- 
tences 24-27, the underlying form of the 
verb must be phone + up, since English 
restricts the particles that can accompany 
certain verbs. Thus we have no 


30a. *John phoned down the girl. 
30b. *John phoned the girl down. 


Similarly, 31а, but not 31b, is gram- 


matical. 
31a. They looked the house over. 
31b. *They looked the house in. 
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To account for such examples and for 
the fact that the verb-particle sequence 
forms a semantic unit, such sequences 
are recorded in the underlying form as 
Serb + particle. But notice that, though 
the particle precedes the peor in 
the underlying representation of 27, a 
mandatory transformation of that under- 
lying structure permutes the particle and 
the object whenever the latter is a pro- 
noun. Hence the underlying form of 27 
has a different order from its manifest 
form. Moreover, this underlying order 
is never directly reflected in an actual 
sentence. . 

Finally, even the very simplest sen- 
tences derive from an underlying form in 
which the order of the elements differs 
from the manifest order. Consider: 


32. The child runs. 


The terminal portion of the tree rep- 
resenting the underlying structure of this 
sentence is: 


33. Det N sg pres V, 
(Do/Caild/run. 


The tense marking (present) 
the verb phrase in the under] 
ture in order to permit a uni 
ment with more complicated 


precedes 
ying struc- 
form treat- 


sentences. 
In the manifest Sentence, the tense 
marker is always attached to the first 


verbal element in the phrase 
Whether that element is an auxiliary, 
modal, or compound verb. This is ac- 
complished for complicated expansions 
of the auxiliary by a rule Which per- 


mutes the tense marker with the ad. 


Jacent verbal element to the right, Tf 
the tense and verb order for 32 were not 
aS represented in 33, 32 


verb 


would con- 

stitute an exception to this rule and 

would thus require a complication of the 

grammar (cf. Chomsky, 1957, and Foot- 
note 9). 

This indirect relation between the 


order of elements in the underlying form 
and their order in the manifest sentence 
poses serious problems for any theory 
of syntax learning. It is clear that the 
child never encounters manifest models 
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of the underlying sentence order. Adults 


do not utter such sequences as: 
34. Фә Caild s run. 


We have seen, however, that in a 
formalization of English grammar 4 
represents a step in the derivation of 32. 
Hence, if the child who learns English 
learns the rules governing the syntax, 16 
follows that he must learn to manipulate 
underlying structures, for which his 
verbal. environment provides him with 
no explicit models. It is evident that 
contextual generalization, simply be- 
cause it is a variety of generalization, 
cannot account for such learning. 


“SUBLANGUAGES” 


We have seen that relatively little of 
the syntactic character of simple declara- 
tive sentences can be expressed in terms 
of the manifest order of their constit- 
uents. We now ask how much of the 
syntactic relations between simple de- 
clarative and other types of sentences 
can be expressed in those terms. We 
maintain that the behavior of a linguistic 
unit in complex sentences cannot be 
captured by a theory which represents 
its syntactic properties by a list of ie 
positions it is capable of occupying: 
There are two reasons for this, one 0! 
which Braine acknowledges; the othet 
has escaped him, 

It is clearly possible to construct com- 
plicated sentences exhibiting almost any 


B . in 
order of syntactic constituents. Thus | 
Sentence 15, Taine would presumably 
distinguish three 


major positions oc 
cupied, respectively, by the boy, is chas- 
ing, and the dogs. The relations be 
tween these three positions are however: 
very flexible, Consider: 


35 
boy. 


36. It is the dogs the boy is chasing- 

37. What is being chased by the boy 
are the dogs, 

38. What 
dogs. 

39. It is the dogs that are being chased 
by the boy, 


* The dogs are being chased by the 


; he 
the boy is chasing are t 
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40. Chasing them is what the boy is 
doing to the dogs. 
_ 4l. Chasing the dogs is what the boy 
is doing. i 


Far from it being the case, as Braine sug- 
gests, that transformation tends to produce 
relatively minor variations in order, ex- 
amples 35—41 show that order is the 
Syntactic property transformation is 
least likely to preserve. 

, Braine is aware that relative order 
is preserved only within sentences of the 
same syntactic type. This leads him to 
suggest that natural languages should be 
a aos of as systems of "sublanguages." 
eti ie аы by a characteristic constit- 
i order. This suggestion is defective 
"s a number of ways. First, if sub- 
anguages are specified solely by the 
Acre. relations of their constituents, 
there must be an infinity of sublanguages 
s sentences in a natural language may 
be arbitrarily long, and every pair of 
Sentences which differ in length will 
ipso facto differ in constituent order at 
Some level It is thus logically im- 
Possible that the child should master his 
language by learning one sublanguage 
after another. 

The preceding demonstrates what was 
shown above on purely syntactic grounds: 
The differences between types of sentences 
cannot, in general, be specified in terms 
Of differences in constituent orders 
Since sublanguages cannot be defined in 
those terms. 

This can be seen most strikingly in the 
ambiguity of such sentences as: 


42. The office of the president is vacant. 


where the lexical items and order of 
constituents at all levels are the same on 
both readings despite the fact that any 
reasonable analysis of English into sub- 
languages must assign 42 to two sub- 
languages, because it can be questioned 
ws. quite different ways (cf. 43 and 


resi- 


43. Where is the office of the p 
dent ? 

44, Who held the office 
dent? 


of the presi- 


There are other examples which illus- 
trate the impossibility of exploiting con- 
stituent order to capture relations be- 
tween syntactically different types of 
sentences: 45 is related to 46 rather 
than to 47 while the reverse holds for 48. 


45. John is easy to please. 
46. They please John. 

47, John pleases them. 

48. John is eager to please. 


Since, however, the constituents of 45 
and 47 are ordered in precisely the same 
way, we cannot appeal to order to explain 
the relations between the sublanguages 
these sentences belong to. 

These examples reflect a fact that must 
be evident to anyone who has seriously 
considered the problem of describing the 
syntactic structure of a language like 
English: Very few of the interrelations 
between types of sentences are expressed 
by similarities and differences of manifest 
order. Rather, a theory which marks 
such interrelations must do so in terms 
of highly abstract constructs such as the 
phrase analyses and transformational 
histories underlying sentences. Linguistic 
evidence shows that no simpler apparatus 
explains intersentential relations that are 
intuitively evident to native speakers. It 
follows that Braine’s attempt to specify 
sublanguages and their interrelations in 
terms of constituent order can be of no 
serious explanatory value. 


INFLECTION 


We now turn to some of the broader 
issues raised by Braine’s theory. The 
first arises from difficulties implicit in 
Braine’s assumption that the processes 
employed in learning order languages 
are different in kind from those em- 
ployed in learning inflected languages. 

The search for the mechanism 0 
syntax acquisition is, presumably, a 
search for species-specific capacities com- 
mon to all children. An adequate theory 
of syntax acquisition must explain the 
ability of a child to learn any language 
to which he happens to be exposed. It 
follows that a theory of language learn- 
ing which makes essential reference to 
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intonation is intimately related to syn- 
tactic structure, this indicates that the 
order of assimilation of syntactic struc- 
tures is unlikely to be the one mentioned 
above. Instead, it suggests that derived 
constituent structure may be among the 
earliest syntactic information assimilated. 
In short, the correct interpretation may 
be not that the perceived | location of 
pause, stress, and intonation are the 
child’s clue to the analysis of structure, 
but rather that the prior analysis ої 
structure is what determines where the 
child learns to hear pause, stress, and 
intonation. Nor is the possibility of 
some intermediate position excluded. 


EXPERIMENTATION ON CONTEXTUAL 
GENERALIZATION 


We have considered some inadequacies 
of explanations of syntax assimilation 
that appeal to contextual generalization. 
However, Braine has shown that sub- 
jects can learn miniature artificial lan- 
guages whose constituents are nonsense 
words by position generalization. This 
may appear to provide a foundation for 
theories of language learning in which 
contextual generalization plays ап im- 
portant role. However, 
sults are equivocal. 
of his experimental t 
fore in order, 


Braine’s re- 
A brief discussion 
echniques is there- 


There is some danger in applying 
results garnered in the study of the 
learning of artificial languages by 
latency-aged children to the learning of 
first languages by preschoolers, First, 
it is possible that the psychological proc- 
esses mediating the learning of first lan- 
guages are pretuned for the assimilation 
of systems having quite Specific formal 
properties. If this is correct, very little 
will be revealed about these processes 
by studying the assimilation of artificial 
languages whose structure is arbitrarily 
different from that of natural languages. 

Second, as Thorpe (1961) has sug- 
gested, the learning of language may be 
one of the capacities for which 


(around 1-1} years) as the period when 
the child "speaks without words." 
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there . . . exist specific brain mechanisms 
ready to be activated during and only during 
a particular period of the life span of the 
child and . . . if they are not properly acti- 
vated at the right time subsequent activation 
is difficult or impossible, resulting in perma- 
nent disabilities in later life [p. 200; cf. also 
Penfield & Roberts, 1959]. 


This speculation supports the widely held 
view that adults learn second languages 
in a manner essentially different from the 
way children learn first languages. | 
Braine has attempted to control jor 
this source of error by replicating the 
simpler of his experiments with a popu- 
lation of 4-year-olds. The point is net 
however, that very young children may 
be incapable of contextual generalization. 
It is rather that there may be language- 
learning processes operative in younge! 
children which do not occur in older ones. 
Braine controls for the former but not 
the latter of these possibilities. | í 
We now consider the details o 
Braine's experiments. It is essential to 
Braine's argument to demonstrate the 
existence of contextual generālizni m 
among phrases as well as words. V 
have seen that Braine defines the notion 
of a position in such a way that it йу 
be occupied by a word in опе senten 
and by a phrase in another. дей 
Braine’s experiments are thus divi à 
into two groups. The first demonstra : 
contextual generalization in a languag" 
Whose sentences are all of the type show 


А Р 
р 
а 
ed 
Fic. 2. Tree diagram of the allege! 


А jne’s 
n Braine 
Structure of nonsense sentences 1 


a+ p language. 
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/\ 
А 


Я Fic. 3. Tree diagram of the alleged struc- 
ure of the nonsense sentences in Braine's 
а + xp language. 


in Figure 2, where A and P are classes of 
single (nonsense) words, and a and p 
are items drawn from those classes. 

Braine's second experiment purports 
to show that contextual generalization 
can also function when the constituents 
are phrases. Here Braine uses language 
in which sentences are said to be either 
of the type shown in Figures 3 or 4. 

But Braine provides no support for 
the analyses of the String axp in Figures 
3 and 4 beyond the remark that, in the 
situations where ax was intended to have 
phrase status, a and x were presented 
together, while in the situation where xp 
was intended to have phrase status, x and 
p were presented together. 

In short, Braine's experimentation is 
at least compatible with the interpreta- 
tion that what his subjects lcarncd were 
Strings with the structure exhibited in 
Figure 5. So long as this interpreta- 


5. 


/\ | 


the alleged 


diagram of 
sentences in 


Fic. 4. Tree 
nonsense 


structure of the 
Braine's ax + p language. 
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tion remains open, Braine can hardly 
claim that his experiments demonstrate 
contextual generalization among phrase- 
length constituents. 

In experiments with artificial lan- 
guages consisting of a finite set of 
strings, it is extremely difficult to sup- 
port claims about the psychological 
reality of abstract structures like phrases. 
In natural languages, appeals to the in- 
tuitions of speakers concerning the ap- 
propriate segmentation of sentences con- 
strain phrase analyses to some extent. 
More important, formulation of the rules 
for transformation and stress imposes 
certain requirements upon the phrase- 
structure analysis since the phrase struc- 


ture articulates the domain to which 
these rules apply. In the case of the 
languages Braine studies, 


artificial 
neither of these considerations holds. 
Speakers do not have structural intuitions 


S 


a » р 

Fic. 5. А possible alternative structure 
for sentences allegedly belonging either to 
ac xp or toax + p languages. 


and no lan- 
finite set of 
cture rules in 
language can 


about artificial language. 
guage which consists of a 
strings requires phrase-stru 
its grammar, for any such 
be enumerated by a simple list. 16, there- 
fore, it is suggested that Braine's sub- 
jects may not have analyzed the stimulus 
material in the way Braine wanted them 
to, it is difficult to see what reply he 


could make. 
The difficulty 
Braine's belief that 
sible for the perceptual integrity of the 
phrase is association. On that view, 
co-presentation of pairs of nonsense syl- 
lables would perhaps tend to establish 
such pairs as phrases. The fact that the 
pairs of items intended to be perceived 


may have stemmed from 
the process геѕроп- 
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as phrases were presented together would 
thus help to jus the analysis provided 
i igures 3 and 4. 

E claim that the integrity of the 
phrase is mediated by processes of as- 
sociation is, however, totally untenable. 
Phrases in natural language can exhibit 
grammatical dependencies across strings 
of arbitrary length. It is thus inconceiv- 
able that the psychological unity of 
phrases could be attributed to associative 
bonds between the elements. : Conversely, 
the suggestion that the basis of phrase 
integrity is associationistic in short 
phrases but that some different and un- 
known psychological mechanism operates 
in longer phrases is too evidently ad hoc 
to merit considerations. 
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ON THE BASIS OF PHRASE STRUCTURE: 
A REPLY TO BEVER, FODOR, AND WEKSEL 


MARTIN D. S. BRAINE 


Walter Reed Army Institute of Research, Washington, D. C. 


On Bever et al.’s central contention, 
that the order of elements in the simple declarative 


insufficient to show 


sentence is a deformation of the order in tl 
by the choice of a particular phrase- 


deformation appears to be imposed 


structure model, rather than a fact about English. 
the writer's proposals on grammar acquisi- 


insufficient and imprecise, 


tion seem to have more potentialities 
little is known for it to be argued tha 


character of the verbal environment. 
{ word classes and several other issues raised 


respect to the learning o 
by Bever et al. are discussed. 


‘ue arguments of Bever, Fodor, and 
Veksel (1965b) will be discussed in the 
order in which they were presented. 


THE VERBAL ENVIRONMENT 


Bever et al. begin by saying that I 
(Braine, 1963) claim “the verbal environ- 
ment of the child exhibits a preponderance 
of grammatical simple declaratives [p. 
470],” and they cite data to the con- 
trary, Their data also indicate that the 
child is exposed to many ungrammatical 
utterances, although there appears to be 
some disagreement with Brown and Bel- 
lugi (1964) on this point. 

1 did not claim that children were 
mainly exposed to simple declaratives, and 
It greatly distorts my conception to think 
that a simple count of the number of 
simple declaratives in adult speech is rele- 
vant to it. Information about positional 
and contingency relations in simple sen- 
tences is abundantly exemplified in many 
transforms (as well as in many “not- 
fully-grammatical” utterances). As one 
among many examples that could be cited, 
consider the similarity in verb-phrase 
Structure between passives and sentences 
with adjective predicators (e.g. George 
quas serued-—George Das sensible): The 
same auxiliary structure (has been, would 
be, might have been being: etc.) occurs 
with past participles and with adjectives. 
Experience with the verb structure of one 
Sentence type could hardly fail to trans- 


the linguistic evidence seems 
he underlying string: The 
While undoubtedly 


than Bever et al. allow. Too 
t they are inconsistent with the 
Limitations of the proposals in 


fer to the other type. In general, it seems 


to me that too little is known about the 
verbal environment and the child's re- 
sponse to it to build much of an argument 
for or against my proposals. 
The conflicting data on the 
of ungrammatical utterances raise some 
question as to the criteria of grammati- 
cality used. It seems important to dis- 
tinguish at least three kinds of not-fully- 
grammatical utterances: (a) incomplete 
sentences where the speaker hesitates OT 
changes his mind in midsentence—these 
are likely to be grammatical between hesi- 
tations, (b) correctly constructed phrases 
(isolated noun phrases, prepositional 
phrases, etc.) which expose the major 
components of sentences to the learner un- 
cluttered by context and may be important 
in the learning of the segmentation of 
sentences, aS well as in the learning of 
the internal structure of the phrases them- 
selves, (с) radically ungrammatical ut- 
terances with the appearance of being ran- 
dom combinations of words (eg. To 
went school he for He quent to school). 
It is primarily this last category. which 
seems very infrequent, that might be ex- 
pected to handicap learning. 
G STRUCTURE 


prevalence 
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The next argument is central to the 
critique. It has two parts: Bever et al. 
claim I made a mistake in supposing (a) 


that there is “а base or kernel grammar 


MANIFEST AND UN 
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484 
i nd only simple declara- 

en e p that athe transforma- 
tonal operations producing complex Seu 
are defined over the declaratives 

oe is base component [p. 
generated by this ba: р d 
-471].” It should be carefully E Р : 
(b) is only very tangentially re eai о 
my article. My discussion of transforma- 
tions was too brief to be more than approx- 
imate, and its main purpose was to show 
that the existence of transformations did 
not contradict my proposals concerning 
simple declaratives. Yet most of the 
arguments of Bever et al. relate to (b) 
rather than (a). In particular, their long 
discussion of the passive transformation, 
associated with sentences 7a-19, is rele- 
vant only to the question of what the 
passive transformation is a transforma- 
tion of. Bever et al. argue that certain 
rules (e.g., the subject-verb concord) ap- 
ply to both active and passive, so that 
much duplication would be avoided in a 
grammar if these rules applied after the 
transformation was made. This argument 
says nothing about the relation between 
the kernel grammar and simple declara- 
tive sentences. Bever et al. implicitly ad- 
mit its irrelevance when they say “ana- 
logous considerations show that the active 
itself is merely one of the deformations of 
this underlying structure [р. 473]." 
Whether the Simple declarative is a 
"deformation" of an underlying terminal 
string thus turns on what these unstated 
"analogous considerations" are. It 
difficult to take issue with unstated 
ments. The discussion below rela 
English, for which I am fairly clear 
what the analogous considerations a 
I know of only one place in the genera- 
tive grammar literature where a direct 
argument is made that transformations 
are involved in the generation of the 
simple declarative English sentence—that 
is Chomsky’s (1957, pp. 38-42) argument 
relating to the auxiliary verb structure, 
Chomsky makes it clear that the validity 
of his argument turns on the question of 


whether discontinuous constituents are 
permissible in a phrase 


mar. He concludes: 
verb phrase we really 
elements, e.g., 


is 

argu- 
tes to 
about 
re. 


“Structure gram- 
"In the auxiliary 
have discontinuous 
e. have. en and be 
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. ing. But discontinuities cannot be 
handled within [phrase-structure] died 
mars [p. 41]." Whether or not a guum. 
structure (kernel) grammar can perm 
discontinuities is a technical question 
about the form of such grammars. Since 
Bever et al. rest so much of their case e 
the argument that the simple Lie yos 
sentence does not reflect the order of ele 
ments in the underlying string, it is pr 
sary to examine their conception of 2 
phrase-structure grammar in detail. " 

Phrase-structure grammars emp mi 
symbols representing morpheme e 
phrase classes (conventionally Lana de 
capital letters) and symbols representi s 
morphemes (conventionally written lower 
case). Rules are of the form ф > v К 
be read “р may be rewritten as ac 
where ф and y are strings of symbo d 
There is a privileged symbol, S (— деле 
tence”), and the first rewriting rule =i 
plied in generating a sentence is of th 
form S — . . .; each rewriting rule per 
mits one capital letter to be rewritten, an a 
the rewriting rules are successively = 
plied until all capital letters have been ui 
written, leaving a string of morphemes 
Which is the sentence or "terminal pup 
generated. The grammars defined in € 
manner comprise a large family of cls 
sible models. Different types of mode 
in the family are specified by esae d 
kinds of rewriting rules that are permit- 
ted (see Chomsky, 1963, for a Hen. 
tion of several types), The резина 
of discontinuous rules turns on bas y 
it is required in expansion rules А. cB 
that y be a continuous string. Dis rm 
tinuous rules drop this requirement Ra ES 
extent of permitting such goce PN 
А л... p, where the ellipsis in E p 
that A and p straddle the element © X 
right of A. Thus, a pair of rules, > s О! 
AY, A>)... y, generates ie d 
the form AYy in which the palate ch 
are Y and X... p. A grammar ed to 
forbids discontinuous rules is p ett 
generate such a string in two d 
Phrase-structure rules, Хэ АҮ m the 
— Xp, generate terminal strings ermutes 
order AY, and a transform ERAT Fa gram- 
the order ХШҮ into AYp. Suc » nguage 
mar is forced to impose on the /а 
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a distinction between manifest structure 
(AY) and “underlying structure” (АШУ). 
It is now known that certain kinds of dis- 
continuities, which include those that ap- 
pear in simple English sentences, can be 
tolerated in phrase-structure grammars 
(Matthews, 1963; Yngve, 1960). The 
recent work makes it clear that Chomsky's 
(1957, p. 41) statement that "discontinu- 
ous rules cannot be handled within 
[phrase-structure] grammars" is wrong. 
The use of models which do not admit dis- 
continuities is controversial within lin- 
guistics.? Discontinuous constituents 
have been widely used by grammarians 
outside the generative grammar school 
ур Pen. 1946; Harris, 1951; Wells, 

In their section, "The Derivation of 
Qui (which continues the argument 
begun in their earlier section, “The Role 
of Declarative Sentences”) Bever et al. 
cite as evidence for the distinction be- 
tween manifest and underlying order the 
Phrases be chasing and phone her up. 


1Yngve’s discontinuous rules are of the 
form A — л... m with the restriction 
that ‘Sentences are to be generated from left 
to right. Matthews defines a much wider 
class of discontinuities. The requirement of 
left-to-right generation could probably be 
dropped, although specification of an order 
of application of rules is necessary. (In the 
above example, if Y — WZ, then the rule 
sequence X — AY, А > ^.. -m Y 7? 
WZ generates AWZp, and the rule sequence 
X — AY, Y > WZ, А > А... p gen- 
erates \WyuZ: Failure to specify an order of 
application would leave the terminal order 
vague.) Specification of an order among 
rules is a normal practice in grammars with- 
out discontinuities (Bach, 1964). Matthews 
shows that, with the left-to-right restriction, 
grammars with discontinuous rules generate 
context-free languages ; without this restric- 
tion they generate context-sensitive lan- 


guages. 


2 Unfortunately, the argument over dis- 


continuous rules has taken place not over 
whether such rules exist and what their 
Precise form is but rather as part of the 
larger argument that transformational gram- 


mars are necessary (€« Harman, 1963; 
Postal 1964). Тһе likelihood that both 


transformational and discontinuous rules exist 
has received little discussion- 
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Both these are of the AY p form. A gram- 
mar which forbids representing the con- 
stituents in the form be . . . ing and 
phone . . . up must necessarily gener- 
ate the orders be+ing+chase and phone 
-pup--her; having generated the items in 
the wrong order, it is of course necessary 
to permute them into the actual order. 
The entire argument associated with sen- 
tences 202-34 rests on a particular choice 
of phrase-structure model. 

This distinction between manifest and 
underlying structure in kernel sentences 
creates a special kind of methodological 
difficulty in assessing the validity of gen- 
erative grammars. A full grammar con- 
tains both a phrase structure and a trans- 
formational component. If the terminal 
strings generated by the phrase structure 
are permitted to be arbitrarily different 
from any actual sentence structures, there 
are no independent data against which the 
phrase structure and the transformational 
rules can be separately tested. Since 
transformational rules provide an extra- 
ordinarily powerful tool for mapping one 
system into another, the grammarian can 
write the phrase-structure kernel partly 
on the basis of his convenience, free to 
correct any poor fit with the manifest 
structure of the language by using trans- 
forms to reshuffle the elements. This 
methodological looseness makes it im- 
possible to accept empirical claims about 
the properties of phrase-structure gram- 
mars of natural languages. (For ex- 
ample, the similarity in structure between 
ordering and inflected languages, reported 
by Bever et al. in their section, “Inflec- 
tion,” may not reflect a linguistic univer- 
sal, but rather the prior decision to treat 
a certain kind of phrase-structure descrip- 
tion as convenient.) 

In general, in order to establish the 
existence of “deformations” in simple 
declaratives, Bever et al. would have to 
show that there are no feasible forms of 
phrase-structure model which would per- 
mit transformational grammars to be 
written so that only the identity transform 
intervened between the phrase structure 
and the morphophonemic rules. No one 
has yet attempted to make such a difficult 
argument. Instead, it seems to have been 
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assumed that, if transformational rules 
are to be used at all, then there is mb 
motive for not exploiting them, leaving 
the detailed character of phrase-structure 
grammars to be settled, in part, according 
to the grammarian's convenience. It is 
doubtful, however, that the distinction 
between manifest and underlying struc- 
ture in kernel sentences is ultimately con- 
venient even to the grammarian, for the 
methodological reasons noted above. The 
distinction is certainly not. convenient 
psychologically: The assumption that the 
learner learns structures to which he was 
never exposed would indeed pose “seri- 
ous problems for amy theory of syntax 
learning [p. 476]," as Bever et al. note. 


THE SCOPE or THE PROPOSALS 


The next argument, associated with 
sentences 20-28, is that my proposals ac- 
count for very little of the structure even 
of simple declarative sentences. Although 
my article did not claim sufficiency, my 
proposals seem to account for more of 
the structure of the simple declarative 
than Bever et al. allow, 

The assertion that the theory must fail 
for sentences 24-28 reflects an over-sim- 
plistic interpretation of the theory. Posi- 
tions learned are those determined by the 
immediate constituent analysis; I never 
remotely suggested they were a simple 
1-2-3-4 ordinal series. John phoned Jane 
up contains two main parts—John and 
phoned Jane up; phoned Jane up contains 
two parts—phoned . , up and Jane, 
Presumably English speakers learn that 
verb units often contain verb-particle 
Sequences with the particle frequently con- 
tingent on the verb (phone up but not 
phone down, think over but not think 
across, etc.), and they also learn that 
noun phrases sometimes follow the yerb- 
particle sequence (phone up the girl) and 
sometimes occur between these two parts 
of the verb unit (phone the girl up). 
They may also learn that a short list him, 


her, it, us, etc. always go between the 
verb and the particle.? 


з More probably, however, they learn a 
stress correlate for the construction. Bever 
et al. make a mistake about English struc- 


Sentences 20-23 are not apposite for 
several reasons. First, Bever et al. seri- 
ously misunderstand my proposals : They 
assume I postulate that if a list of items 
occurs in a certain position, and one mem- 
ber of the list occurs in some up cape 
ogous position, then the entire list wi 
occur in both positions. This is a 
Stronger generalization mechanism р € 
I proposed. Second, the anomaly in X 
presumably lies in the verb phrase was 
cost, not in the occurrence of fen n 
as subject. Third, my article specifically 
excluded the learning of transforms, 50 
that neither 22 nor 23 are appropriate 
examples; my proposals could capture 
the structure of passives only insofar as 
they are similar to predicate phrases 
with adjective heads (cf. The kazoo was 
blown by the child; The coffee was hot on 
the stove). | 

Since it appears that Bever et al. have 
misunderstood the proposals, I must try Xo 
restate them more clearly and clarify their 
limitations. The discussion includes some 
Tecent thinking. The proposals are that 
what is learned are the temporal positions 
of units in verbal arrays and contingen- 
cies between morphemes. The poir 
learned is the position of a unit within ШЕ 
next-larger containing unit of a hierarchy 
of units. Position within a unit may i 
defined absolutely (e.g., first) or rp m 
to a reference point (e.g., before f, pre 
after f, second after f, where f is some E 
quently Occurring morpheme—learning О 
such positions is shown in Braine, 1965b). 
Contingent morphemes need not be adjar 
cent. All these positional and cantingen 
relationships assumed to be learned 22 
be represented by rewriting rules; 10 


es r 
ture when they assert that “it is a necessary 


№ Waite n- 
condition upon the grammaticality of se 
tences of the form: 


28. John phoned up X. 


that X not be a pronoun [p. 474].” 2x 
nouns can occur after the particle ned s 
are strongly stressed (THAT woman! oe 
phoned up HER!). There appears En in 
some requirement that the terminal haa p 
this construction have at least т. 
stress; pronouns are not normally as stro the 
stressed, and this may be the reason 
terminal placement is avoided. 
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example, the rules X — fAB, X — gAC, 
A> ay 4s + а a B Эу bs. „C> 
€, €, ... represent that X phrases are 
prefixed by either f or g, that the a; occur 
in second position, and that the b; and с; 
occur next-but-one after f and g. respec- 
tively. An adequate notation would pre- 
sumably be a phrase-structure system of 
some kind. (The insufficiency of a finite- 
state notation is shown in Braine, 1965a.) 
A phrase-structure grammar is essen- 
tially a system for representing hierarch- 
ically organized positional relationships, 
and my proposals posit that it is these 
positional relations that are learned. А 
i аа the theory appears to provide 

sufficient basis for the learning of the 
treelike structures of a phrase-structure 
grammar, its handling of the learning of 
Word classes requires examination. The 
theory implies some special limitations on 
the learning of unmarked word lists (i.e. 
word lists which are not correlated with 
Particular function morphemes). The 
learner should have two kinds of diffi- 
culties with unmarked word lists: regis- 
tering that (a) two such lists occurring 
at homologous locations are different lists, 
and that (b) one list occurring at two 
nonhomologous locations is the same list. 
The nature of the problem may be illus- 
trated by considering a pair of rules of the 
form X — AB, X — CD, where A, B, C, 
and D are arbitrary lists of words. E 
posed to such a system, the learner should 
readily learn which words go first and 
which second, but he has no way of dis- 
tinguishing A items from C items, or B 
items from D items; thus, he should con- 
fuse the subsystems. (There is labora- 
tory evidence for such confusion—Smith, 
1965.) For the subsystems to be distin- 
guished, either the classes have to be 
marked (eg, X — АВ, X > cD, A’ 
— fA, B > gB, C > hC, D' — kD), 
or else the subsystems have to be marked 
(eg, X = fAR, X > gCD, where the 
list positions are differentiated as “first 
after f," “second after f,” etc.). 

Tt follows from the above considerations 
that the kind of phrase-structure system 
for which my proposals are adequate 15 


one which permits only one unmarked 
word list at any given location and which 
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restricts the circumstances in which an 
unmarked list can occur at two nonhomol- 
ogous locations. A property of this sort 
may be called an “indexing” restriction, 
since such a grammar provides, in effect, 
an overt indexing system for its vocabu- 
lary lists. An indexing restriction would 
explain the purpose of function mor- 
phemes: Perhaps the human subject needs 
function morphemes to index classes and 
distinguish their locations. It is im- 
portant to observe that a phrase-structure 
model which introduces classes as arbi- 
trary lists of words (as in rules like N > 
man, ball, Vo hit, take, . . -). 
and which imposes no restrictions on how 
the lists may be distributed in the rules, 
assigns to the human subject an unlimited 
capacity to learn and retain long lists of 
unrelated items occurring in complex 
positional relationships to each other 
without becoming confused. At the most 
general level, the question involved here 
is whether characteristics of human learn- 
ing and long-term memory place any es- 
sential limitations on the form of a gram- 
mar. No limitations are built into the 
context-free and context-sensitive models 
(Chomsky, 1963) assumed by Bever et al., 
and these models do not distinguish func- 
tion morphemes and lexicon. I would 
urge that laboratory studies of the learn- 
ing of semantically empty systems can 
make an important contribution to the in- 
vestigation of subjects’ abilities to handle 
lists of unrelated words. A promising 


method for such studies is suggested else- 
cf. also Smith, 


where (Braine, 1965a; 
1963, 1965). 
Covert Categories 
Despite the plausibil 
indexing restriction, it is a fact that 
languages contain structures 10 which 
the same position can be occupied by two 
or more classes which are not differen- 
tially marked. Bever et al. are correct 


ity of some kind of 


te this restriction precisely, much 
ntal work is required: Loca- 
tions can be partially homologous, and the 
circumstances in which lists occurring at par- 
tially homologous locations tend to be learned 
as the same list, or as different lists, have 


to be determined by experiment. 


3 To sta 
more experime 
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in pointing out that this sort of ons 
poses a problem for my proposals. га 
“covert categories,” as they were calle 
by Whorf (1956), may ultimately be 
based on semantic properties or relations. 
This notion would be consistent with the 
thinking behind my theory : Class markers 
provide a device for making heterogene- 
ous items similar (they all “go with” the 
marker); however, when items are al- 
ready similar by virtue of a semantic 
property or relation of some sort, it would 
be readily understandable that the overt 
marker might not be needed—the covert 
semantic marker would suffice to index 
the list. Covert semantic markers un- 
doubtedly exist. Proper names provide 
an obvious example: A speaker hearing 
an exotic name for the first time presum- 
ably does not need to hear it used in noun 
positions in sentences in order to know 
its grammatical category. Unfortunately, 
semantic correlates are rarely so trans- 
parent. More often, although the class 
has a set of nuclear members which 
clearly have a common property, it also 
contains many words without the prop- 
erty. For example, Whorf (1956) says 
of some covert noun subclasses in Navaho: 


Some terms belon 


g to the round (or round- 
ish) class, 


others to the long-object class, 
into classes not dependent on 
shape. . . . I doubt that such distinctions 
are simply... recognitions of... objec- 
tive differences. . . . One must learn as a 
Part of learning Navaho that “sorrow” be- 
longs in the round class [p. 91]. 


Since the speaker has to learn not only 
that round versus long is the basis of clas- 
Sification but also that "sorrow" is to 
"count as" round, it follows that a seman- 
tic-correlate theory of covert categories 
gives the correlate the Status of a medi- 


ator in the sense of Jenkins and Palermo 
(1964), 


Bever et al. are mis 
that the ideas of Jenkin 
“simply a weaker versi 


475]."5 


taken in asserting 
ѕ and Palermo are 
on of Braine's [p. 
Bever et al. point out that Jen- 


5 [ncidentally, it is inappropriate of Bever 
et al. to attribute to Jenkins and Palermo a 
belief that the employment of mediational 
paradigms affords "a breakthrough" in the 
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kins and Palermo have little to say аы 
phrase structure, and the difficulties face 

by mediation theory in explaining posi- 
tion learning have been indicated iy 
Smith (1965). However, Jenkins an | 
Palermo do provide a reasonable theory 
of word-class formation that is at _ 
theoretically capable of explaining cover 

categories. Their ideas are complement- 
ary to mine and not a weaker competitor. 
It is important to note, however, that mp 
proposals imply a restriction on the scop 

of the four-stage stimulus lu aa ert 
paradigm: It should not be demonstra à e 
without some semantic support for the 
equivalence classes. 


SECONDARY Issues 
А » 
Transformations and “Sublanguages 


My article was mainly concerned with 
Phrase structure, and the brief overview 
on transformations was not intended as s 
Precise statement about English, bu 
rather as a general line of thought as km 
how a quest for what is learned migh 
approach these aspects of English struc- 
ture. English clearly does present hs 
number of parallel phrase and sentan 
forms of the kind described as sullen 
guages. What might be learned in latu 
ing transformations is difficult to iud 
cuss in part because the transformation 
Concept itself seems unclear. The un 
Clarity appears in at least three pe 
First, it seems impossible to deduce mon 
Chomsky’s (1961) definition of a trans 
formational grammar what үш 
should Бе built into a verbal learning sect 
periment in order to ensure that a subject 
is faced with the task of learning a ere 
formation ; laboratory study is therefo a 
difficult, Second, it seems likely vara 
resolution of Putnam’s (1961, p. 41) cri 5- 
cism, that Chomsky’s definition of a T m 
formational grammar is “much too wl T 
will affect the character of such Бе 
mars. Third, there are serious unreso! pm 
difficulties associated with the Deed 
Structure of the products of чив 
tion (Bach, 1964, pp. 70-78; С Об) 
1955); discussing these, Chomsky ( 


kins 
study of syntax. This term was not Jen 
and Palermo's, 


THEORETICAL NOTES 


has said candidly: “All answers I have 
seen so far are ad hoc. . . - I think... 
mere is some important insight that is 

ill lacking [p. 158].” Transformations 
are surely destined to be further clarified, 
and detailed discussion of their learning 
P have to wait upon the clarifications. 
Meanwhile, the difficulty of the laboratory 
study of transformations provides a prag- 
matic reason for concentrating on phrase 
Structure. 


Inflection 


а is no truth whatever, nor war- 
I in my article, for the statement that 
ше that the processes employed in 
kind РЕ order languages are different in 
е Dat those employed in learning in- 
ie anguages [р. 477]. | Of course а 
ory of grammar acquisition should ex- 
plain a child's ability to learn any lan- 
guage. However, I doubt that the Latin 
speaker learned what Bever et al. say he 
learned. 
т... тајог cues in Latin are based on 
ative position. The subject of the 
Sentence is the element which occurs im- 
mediately before the nominative-case suf- 
fix; the object is the element immediately 
before the accusative-case suffix; the verb 
is the element to which suffixes compris- 
ing tense, mood, voice, aspect, and person 
morphemes are attached, etc. It is the 
learning of these kinds of relative posi- 
tional relationships which has to be ex- 
plained in giving an account of the learn- 
ing of the grammar of an inflected lan- 
guage like Latin. It is not apparent why 
my viewpoint would have more difficulty 
ү Latin-like languages than with Eng- 
ish, 

The analysis of Latin provided by 
Bever et al. leads to curious problems. 
If the “underlying strings” generated in 
the phrase structure have a rigid word 
order, then it must be assumed that the 
learner learns this order, even though no 
such order is visible in the language. It 
is difficult to see how the learner could 
learn this, or why one should want to 
postulate that he does. The analysis is 
not made more plausible by assuming that 
the learner also learns permutation rules 
governing the movements of the contents 
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of various positions to other positions. 
Moreover, the suffixes, which are clearly 
the cues defining the major grammatical 
positions, enter only peripherally into the 
formulation. I know of no generative 
grammar of Latin that has yet been pub- 
lished, and it may not even be certain that 
a feasibly simple grammar can be written 
in the manner suggested by Bever et al. 
Simple Latin sentences can contain more 
than three words. Six words have 720 
possible orders, and eight have 40,320; 
even allowing for the rarity of some of 
these possible orders, a very large number 
of permutation transforms would pre- 
sumably be required merely to generate 
the kernel sentences. 


Intonation 

I did not claim “that the discrimination 
of acoustic properties of speech signals 
accounts for the learning of segmenta- 
tion [p. 479].” I suggested only that in- 
tonational cues might be helpful, a pos- 
sibility Bever et al. admit. 
Experimentation 

With regard to my Experiment III, it 
is true, as several people have pointed out 
to me, that the segmentations are not in- 
herent in the “languages” but are deter- 
mined by the experimental procedure. 
Throughout the training phases, A and 
PQ phrases (in the A+PQ system) are 
presented as stimulus units in sentence- 
completion problems of the form а... 
(aj, pu) and . . . Pidi (a, руду); it is 
assumed that this presentation 0 the 
phrases as units during training is suffi- 
cient to define the sentences as having two 
parts, an A phrase anda PQ phrase. The 
results of Experiment II provide some 
support for this assumption. : 

The reconstructive-memory technique 
(Braine, 1965a) seems superior to the 
sentence-completion procedure as a 
method for studying gr 
An important reason 
with the sentence-completion 
not that it fails t 
tion, but rather that it does specify it. 
is greatly preferable 


required to discover 
himself. Ihave little doubt that the learn- 


490 


ing of positions within phrases can be 
demonstrated to the satisfaction of Bever 
et al. = 

Bever et al. raise objections to labora- 
tory learning studies. , However, their 
thought that young children may have 
especially rich capacities for grammar 
acquisition ("pretuned") is relevant only 
if it is assumed that these capacities do 
not operate in the laboratory ; people who 
have speculated about the possibility of 
such capacities seem to believe that they 
endure until adolescence (Bellugi & 
Brown, 1964, p. 139), well past the age 
when laboratory work becomes feasible, 
I am unable to follow the second objec- 
tion, that useful laboratory work is ex- 
tremely difficult with artificial languages 
consisting of a finite set of strings. Their 
point that “any such language can be 
enumerated by a simple list [p. 481]” 
would be appropriate only if subjects 
learned a list of sentences as a list. On 
the contrary, memory for a list of pat- 
terned strings is primarily a reconstruc- 
tive process and not a learning of a large 
number of individual items (Braine, 
1965a; Smith, 1965). Subjects learn 
regularities in pattern that the strings 
exhibit, and “recall” is in large part a 
reconstruction of strings which have the 
regularities that the subject has regis- 
tered. Thus, laboratory studies yield in- 
formation about the pattern properties or 
“structural descriptions” learned. 

In conclusion, I would like to return to 
what seems to me the fundamental issue 
at stake in this interchange : whether the 
order of elements in simple sentences re- 
flects the order in the underlying string, 
It seems a very natural psychological ap- 
proach to grammar to regard grammar 
acquisition as a learning of the patterned 
regularities in Sentences, and my pro- 
posals are initial Suggestions as to the 
bases of some kinds of regularities. This 
approach promises to be fruitful. How- 
ever, Bever et al. are arguing that this 
approach is precluded because the learner 
is often never exposed to the pattern prop- 
erties which must be assumed to be 
learned. Before accepting this argument, 
I would urge that psychologists examine 

the linguistic evidence for a distinction 
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between manifest and underlying struc- 


ture in kernel sentences very closely 
indeed.5 


5 (This footnote was added in proof fol- 
lowing receipt of Bever et al.’s (1965a) re- 
joinder.) I see that Bever et al. now allow 
that it would be possible to write a grammar 
of English in which there would be no distinc- 
tion between manifest and underlying struc- 
ture in simple declaratives but claim that such 
a grammar would involve an unnecessarily 
complex system. They seem to assume that 
I would agree, since their section, "The 
Moral" accuses me of "playing fast and 
loose with simplicity constraints [p. 497]." I 
think that it is simply untrue that a phrase- 
Structure analysis permitting discontinuous 
rules must lead to the kinds of complexities 
they claim: They do not appear to have 
investigated the potentialities of such an 
analysis thoroughly enough. Specifically, 
while they have seen that the affix-permuta- 
tion rule ( Chomsky, 1957, p. 39, Rule 29-ii; 
1962, p. 141, Rule 12) would no longer be 
needed in the generation of simple declara- 
tives, they have failed to see that it would not 
be needed elsewhere in the grammar either. 
This rule is the one which introduces the dis- 
tinction between manifest and underlying 
structure into the auxiliary verb phrase, and 
the possibility of its total elimination (a gan 
in “simplicity” in the transformational part 0 
the grammar) is one of the reasons for think- 
ing that English contains discontinuous rules. 
Bever et al. say: "Of course the facts might 
have shown that the discontinuous phase- 
structure rules are the appropriate linguistic 
analysis. To show this, the facts would have 
to be (at least) that no sentence constructions 
require а transformational rule for affix move- 
ment which can also be utilized by the simple 
declarative [p. 496; italics mine]." Since the 
italicized clause is true, the facts argue that 
discontinuous rules are an appropriate analy- 
sis. (Incidentally, there are independent ob- 
jections to the affix-movement rule—see 
Bach, 1964, р. 81, The rule is not well 
formed because the symbol, “Affix,” is in- 


troduced ad hoc, and reformulation is not 
simple.) 


A word on "simplicity": It is possible to 
agree with Bever et al. that simplicity is an 
important consideration in scientific theor- 
izing and still to reject their use of sim- 
plicity arguments. I am troubled by two 
main difficulties of principle. TN 

1. Detailed measurement of simplicity (e.g. 
by means of a rule count, or other metric) 
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pense, The difficulty is well illustrated in 
Bever et al.'s main argument: They contest 
my proposals on the ground that simple Eng- 
lish sentences contain a distinction between 
manifest and underlying structure; however, 
they acknowledge that a grammar could be 
written in which there is no such distinction, 
but claim that such a grammar should be re- 
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less, they allow that the grammatical solution 
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that it would more than compensate for an 
increase in grammatical complexity involved 
in writing the grammar 50 that the under- 
cture was overt. And it has not 
that there must be an increase 
1 complexity. (Ultimately, it 
nious to identify gram- 


th ease of acquisition.) 


lying stru 
been shown 
in grammatica 
would seem parsimo 
matical simplicity wi 
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IS LINGUISTICS EMPIRICAL? 
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This paper continues the discussion of issues raised by Braine’s theory 
of “contextual generalization.” The arguments for analyzing the Eng- 
lish declarative as transformationally generated are discussed at length. 
Broader issues about the nature of confirmation of claims made by 
grammars are also considered. It is argued that while the direct ex- 
perimental verification of such claims is often not feasible, consider- 
ations of simplicity and generality can provide adequate grounds for 
their empirical confirmation or disconfirmation. 


Braine’s (1965) reply to our (1965) 
paper successfully clarifies the two major 
issues between us. The first of these con- 
cerns a detail of the grammar of English: 
Braine claims (and all generative gram- 
marians have denied) that the simple 
declarative sentence has a special lin- 
guistic character in that it has no trans- 
formations in its derivational history. 
The second issue concerns the possible 
bases for confirming or disconfirming par- 
ticular claims about the character of the 
grammatical analysis of a class of sen- 
tences. In our critique we argued that the 
special status Braine seeks for simple 
declaratives can be obtained only at the 
price of the ad hoc decision to treat them 
in isolation from the rest of the types of 
sentences in the language. This is equiv- 
alent to saying that a grammar which 
treats declaratives as nontransformational 
is inherently more complex than a gram- 
mar which does not. The methodological 
issue between us is that Braine does not 
accept such arguments as supplying even 
prima facie confirmation of the trans- 
formational characterization of declara- 
tives, Braine apparently holds that each 
decision in the construction of a grammar 
must be susceptible to direct experimen- 
tal confirmation or else be considered 
a mere artifact of the linguistic descrip- 
tion, adopted “at the convenience of the 
linguist.” It seems to us that Braine 1s 
certainly wrong on both points. On one 
hand the grammar that Braine proposes 
for simple declaratives 15 thoroughly un- 
acceptable; on the other hand to accept 
Braine's view of confirmation would be to 


493 


make linguistic science (or, for that mat- 
ter, any other kind of science) impossible. 
We shall discuss these points in reverse 
order. 

Progress in linguistic analysis of nat- 
ural language has depended on the careful 
separation of the theory of the language 
("langue" or "competence") from the 
theory of the use of langauge (“parole” 
or "performance"). In this way the lin- 
guist has insulated himself from the fact 
that the variables determining the char- 
acter of speech behavior reflect features 
other than the formal structure of the 
spoken language. For instance, the fact 
that some sentences are difficult to say, to 
remember, or to understand is obvious. 
Evidently, such facts are the consequence 
of interactions between linguistic variables ; 
and variables of memory, perception, mo- 
tor integration, etc. To fail to so repre- 
sent these facts would render impossible 
the representation of either the syste- 
matic character of language 07 the sys- 
tematic character of speech behavior. | 

To avoid this consequence, linguistic 
analysis does not accept as pertinent to the 
characterization of language all facts that 
are pertinent to the characterization of 
verbal behavior. Only certain types of 
information about language are consid- 
ered, for example, whether a sequence is 
intuitively grammatical, its relations to 
other sentences, the units of which it is 
constructed, etc. Such information is 
quite real and as palpable as any other 
psychological data; that all facts about 
speech are not considered in linguistic 
analysis does not reduce the scientific ob- 
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jectivity of those that are considered. A 
theory that can explain the latter. phe- 
nomena is in fact a sound, empirically 
supported part of the total theory of lan- 
guage behavior. The linguist and psy- 
chologist seek to discover how the theory 
of the language, which explains one re- 
stricted set of data, is embedded within 
the theory of the speaker of a language, 
which explains speech behavior in general. 
It is not at all surprising that the analy- 

sis of speech behavior should proceed from 
two empirical and theoretical sources. In- 
deed, distinguishing among the different 
kinds of data that constitute super icially 
homogeneous phenomena is absolutely 
universal in scientific explanations: it oc- 
curs wherever considerations of simplicity 
and explanatory power require that the 
observations be represented as interaction 
effects. Consider, for example, the analy- 
sis of a block sliding down an inclined 
plane. There are two kinds of variables 
that interact to determine the block's be- 
havior—first, the forces acting downward 
on the body and determining the accelera- 
tion for an ideal system ; second, the reac- 
tive forces (e.g., friction) due to the char- 
acter of the particular body and plane 
under study. The observed behavior is 
susceptible of systematic expl 
on the view that it is the pro 
actions between these distin 
Since the theory of verba 

is directly concerned wi 
Speakers, it may often be subject to fairly 
direct experimental examination. The 
theory of competence, on the other hand, 
is concerned with the formulation of the 
linguistic information underlying verba] 
behavior. The Speaker's competence is 
only reflected in his behavior via the kinds 
of performance variables mentioned above, 
The direct experimental Verification. of 
the theory of competence is correspond- 
ingly difficult. 
This is not to say that the theory of 
competence is in any sense conventional 
or arbitrary. For, its Support rests not 
only on occasional experimental confirma- 
tion but also on considerations of the- 
oretical simplicity and power, fruitfulness, 
availability for integration with theories 
of performance variables, and so on, It 


anation only 
duct of inter- 
ct systems. 

1 performance 
th the behavior of 
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must be reemphasized that this relative 
inaccessibility to experimental manipula- 
tion is not specific to theories of linguistic 
competence. On the contrary, it 15 oe 
vious that the more a scientific theory 
concerns itself with the fundamental 
mechanisms underlying the observables, 
the less susceptible it is to direct experi- 
mental test. 


Some EMPIRICALLY Baskp FORMAL 
DECISIONS 

We now consider some of the points 
where Braine feels that the form of the 
grammar is at the grammarian's (and 
his) arbitrary disposal. In each case we 
try to show that the particular claim about 
Srammar we made in our original critique 
is responsive to some compelling facts 
about natural language. In any given 
instance it is, of course, possible to argue 
that the facts we invoke are wrong or that 
they might be accounted for in some more 
economical fashion, But it is not possible 
to claim that the form of the grammar is 
arbitrary. We will take the points in the 
order they appear in Braine’s reply and 
omit discussion of Braine's new data ог 
proposals and of the accuracy of our read- 
ing of his original paper.! 


Tue Nonprivitecen Stratus ОЕ SIMPLE 
DECLARATIVE SENTENCES 


We argued that declarative sentences 
have neither linguistic nor psychological 
Preeminence and that the underlying con- 
Stituent structure or simple declarative 
Sentences is itself an abstract object, jum 
as in the case of the passive and all other 
Constructions. Braine counters with sev- 
eral arguments indicating that we did nue 
Suggest evidence for this and that the 
only evidence he can think of relies on 


1If we misunderstand Braine (1963) rs 
Some points in our critique, we apologize to 
him and to the reader and suggest that they 
both. satisfy themselves that our misunder- 
Standings were based on misreading. In fs 
paper we quote Braine directly wheneve 
Possible. We (o not utilize any linguistic 
discoveries made after 1957; recent theoreti- 
cal modifications do not affect Braine’s 
theory nor its incorrectness. 
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arbitrary decisions about the form of 
grammar : 


[1.] .. . their long discussion of the passive 
transformation . . . is relevant only to the 
question of what the passive transformation 
is a transformation of. . This argument 
says nothing about the relation between the 
kernel grammar and simple declarative 
sentences [p. 484; emphasis ours]. 


2. The only argument he can recall in 
favor of an abstract underlying constitu- 
ent structure (UCS) for simple declara- 
tives is the treatment of discontinuous 
elements; he claims this is an artifact of 
an arbitrary decision about the form of 
grammar. 

"Whether or not a phrase-structure 
(kernel) grammar can permit discon- 
tinuities is a technical question about the 
form of such grammars [p. 484; emphasis 
ours]." In particular, our claim that such 
constructions as be chasing are evidence 
for the distinction between manifest and 
underlying order is spurious: "These are 
of the А... ш form [p. 485]." Braine sug- 
gests earlier that such discontinuities can 
be treated by a phrase-structure rule, 
XA... fa where it is a formal con- 
vention that д permutes with the next 
rightmost element. For example, this se- 
quence of phrase-structure rules would 
give the phrase be chasing as an output 
of the underlying phrase structure: 

VP aux V; aux be... ing; V? 
chase. 

A grammar which forbids representing the 
constituents in the form фе... ing 
must necessarily generate [the order] be + 
ing + chase [and we presume John + sg + 
pres + run] and... having generated the 
items in the wrong order, it is of course 
necessary to permute them into the actual 
order. The entire argument associated with 
[these structures] rests on a particular choice 
of phrase-structure model [p. 485; emphasis 
ours]. 

3. Braine then laments the alleged Jack 
of basis for any decisions between "com- 
peting" phrase structure and transforma- 
tional solutions of descriptive problems in 


syntax. 


If the terminal strin 
phrase structure are p! 
trarily different from any а 


gs generated by the 
ermitted to be arbi- 
ctual sentence 
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there are no independent data 
against which the phrase structure and the 
transformational rules can be separately 
tested . . . the grammarian can write the 
phrase-sentence kernel partly on the basis of 
his convenience, free to correct any poor fit 
with the manifest structure of the language 
by using transforms to reshuffle elements. 
This methodological looseness makes it im- 
possible to accept empirical claims about the 
properties of phrase-structure grammars of 
natural languages [p. 485; emphases ours]. 


structures 


4. Braine claims, in summation, that we 
did not show that the simple declarative 
sentence has a distinct abstract form un- 
derlying it, and he asserts that to show 
it we... 


would have to show that there are no 
feasible forms of phrase-structure model 
which would permit transformational gram- 
mars to be written so that only the identity 


transform intervened between the phrase 
structure and the morphophonemic rules 
[p. 485]. 


Our replies can in some instances be 
quite brief. 

1. Braine's argument on page 484 is 
simply wrong. On grounds of simplicity, 
any argument which shows that the un- 
derlying constituent structure for one sen- 
tence type is generated by a particular 
kind of phrase-structure rules is prima 
facie an argument that all sentence types 
have such rules underlying them. А 
grammar does not treat certain sentence 
types to the exclusion of others, and there 
is no reason to believe that a child does 
either. 

2. It is difficult to understand what 
Braine can mean by the claim that the 
explanation of discontinuities is a “tech- 
nical question.” He correctly assesses the 
literature as showing that linguistic for- 
malisms could provide either phrase- 
structure or transformational solutions. 
When nonequivalent explanations of the 
data compete, we must surely decide be- 
tween them by asking which formalism 
accounts for the facts under consideration 
with the greatest economy and generality. 
1f this makes the question of how discon- 
tinuous constituents are to be explained 
“technical,” then all questions of scientific 
theory are technical, since considerations 
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of simplicity and generality underlie the 
solution of all such questions. А 
It is true that the use of discontinuous 
phrase-structure expansions could ade- 
quately generate number agreement and 
appropriate placement of the affix ing in 
the simple declarative sentence. But that 
is all it could do. The arguments in our 
critique showed that for other construc- 
tions (e.g. passive, negative passive, 
question passive, cleft-sentence passive, 
negative-question passive, etc.) the affix- 
movement rule must follow the passive 
rule, which itself follows the development 
of the base structure. Thus, the affix- 
movement rule needed for these construc- 
tions cannot be in the phrase structure 
and is consequently a transformation. 
Indeed, since the same transformation is 
formally capable of accounting for num- 
ber agreement and affix permutation in 
the simple declarative as well as other 
constructions, we have a clear choice be- 
tween two types of grammar. One gram- 
mar uses a discontinuous phrase-structure 
rule for number attachment in the de- 
clarative and a transformation for number 
attachment in all other constructions, The 
other grammar employs only the trans- 
formation. Clearly the latter grammar 
must be preferred since it explains the 
facts with less duplication and with a 
more restricted form of grammar, 
Of course the facts might have shown 
that the discontinuous phrase-structure 
rules are the appropriate linguistic analy- 
sis. To show this, the facts would have 
to be (at least) that no sentence construc- 
tions require a transformational rule for 
affix movement which can also be utilized 
by the simple declarative. That is, the 
transformational solution could have been 
invalidated by the character of English, 
and thus the decision to reject the dis- 
continuous phrase-structure solution is 
not based on the “grammarian’s conven- 
ience," but on the facts of the language. 
3. It is hard to imagine what the “inde- 
pendent data" might be that could “sep- 
arately" confirm any linguistic analysis 
since the theoretical decisions themselves 
are based on all presently available em- 
pirical considerations extractable from the 


language. 


Perhaps we can additionally support 
the decision to treat number agreement 
(and affix-movement phenomena) with 
one transformation by an appeal to the 
reader's intuition about some previously 
unconsidered facts. Is it not the case that 
the subject-verb number agreement (and 
formation of be + ing) is the same sort 
of relation in these two sentences? 

They were running. 

They were being served. 

The analysis proposed by Braine gener- 
ates these two subject-verb agreements 
by two distinct kinds of processes? and we 
might expect that if this analysis were 
true, different intuitions about the nature 
of the agreements could be informally or 
experimentally extracted from native 
speakers. In fact, we have no reason to 
believe that such differences exist, Num- 
ber agreement seems to be a psycholog- 
ically unitary process in English. This 
is reflected in the formal analysis to which 
the linguistic facts led us. 

4. In short, although there is probably 
a “feasible” form of grammar in which 
the underlying and derived constituent 
structures of simple declaratives would be 
identical, this analysis is blocked for Eng- 
lish on empirical grounds. We have shown 
briefly that at very least such a solution 
would involve an unnecessarily complex 
system since there is a simpler analysis 
Which accounts for the relevant facts. In 
this way we show that Braine's claim 
that there is a "base" or “kernel” grammar 
Producing all and only simple declara- 


tives to be logically possible but empiri- 
cally unacceptable, 


THE Morar 


We have argued that there is strong 
empirical Support for the claim that de- 
claratives have abstract underlying struc- 
tures and that this support derives pri- 
marily from considerations of the sim- 
plicity and generality of grammatical 
tules. In conclusion it may be pointed 
Out that, though it is true that a suitable 
complication of the grammar would per- 
mit Braine to treat simple declaratives 


? Discontinuous phrase structure for the 


first, transformational for the second. 
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as the only untransformed structures, 
it must be added that similar compli- 
cations would permit that treatment for 
any other sentence type. There is no 
more reason for complicating the gram- 
mar in order to render declarative sen- 
tences uniquely nontransformational than 
there is for complicating the grammar in 
order to render passives, questions, nega- 
tives, imperatives, etc. uniquely nontrans- 
formational. The difficulty with playing 
fast and loose with simplicity constraints 
is that, once having started, it is hard to 
find a way to stop. 


Tue ExcLUSION OF TRANSFORMATIONS 


Braine states that the ©... main purpose 
[of my discussion of transformations] was 
to show that [their] existence did not con- 
tradict my proposals concerning simple 
declaratives [р. 484]." And later (p. 486), 

my [proposal] specifically excluded the 
learning of transforms . . - my proposals 
could capture the structure of passives 
only insofar as they are similar to predi- 
cate phrases with adjective heads (cf. 
The kazoo was blown by the child; The 
coffee was hot on the stove.)” Further- 
more he states that the fact that many 
transforms appear similar to simple de- 
clarative sentences explains why it is rea- 
sonable to assume that simple declaratives 
are learned separately by the child, even 
thought there is no evidence that they pre- 
ponderate in the child's verbal environ- 


ment: 


I did not claim that children were mainly 
exposed to simple declaratives. - - - Infor- 
mation about positional and contingency rela- 
tions in simple sentences is abundantly exem- 
plified in many transforms «++ consider the 
similarity in verb-phrase structure between 
passives and sentences with adjective predica- 
tors (е. George was served —Gcorge quas 
sensible). - - > i with the ver 

f one sentence type could hardly 
fail to trans 


fer to the other type [P 483; 


This g 1 
volve Braine in a hopeless dilemma. 19 
wit, ot exclude trans- 
formations. objections 1n the 

+ this reply obtain; if the 


critique and 1n y 
child can isolate and reject transforms 
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from consideration, then he must already 
know the grammar including its trans- 
formational component: Transforma- 
tional information is required to distin- 
guish between declaratives and other types 
of sentences. 

This dilemma runs deep. 
cannot yet tell whether an apparent de- 
clarative sentence actually is one, how 
does he know when to transfer experience 
from one putative declarative to another? 
Tt is undeniable that many sentences other 
than declaratives appear in the declarative- 
like form “NP VP.” How does the child 
know which such sentences are sufficiently 
“like” true declarative to permit relevant 
transfer of training unless he already has 
transformational information? How can 
“experience with one type . · - transfer 
to another" sentence of the same type un- 
less the child can see that they are of the 
same type? That is, he must have the 
transformational information required to 
distinguish between real and apparent 
declaratives. Furthermore, how does the 
child know what kind of experience to 
generalise? Surely if everything the 
child knows about 

George was served. 
generalized to 

George was sensible. 
the child should expect the sentence 

*Somebody sensibled George. 


to parallel 
Somebody served George. 


If the child 


NATION OF HomoLocous 
SEQUENCES AND “COVERT CATEGORIES” 
Several different problems for Braine’s 
earlier and present proposals are man- 
ifestations of Braine’s inability to answer 
the question : How are sentence types dif- 
ferentiated when they have similar word- 
class orders! (See our critique for 


examples. 

Braine says, «|, the kind of phrase- 
structure system for which my proposals 
are adequate is one which permits only 


one anmarked word list at any given loca- 


tion and which restricts the circtimstances 
i marked list can occur at 
homologous locations [р. 87]." 


He proposes that this requirement be met 


Tur DISCRIMI 
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by an “indexing restriction” partially aes 
ried by “function words which index 
classes and distinguish their locations 
[p. 487]."? But, he notes, this is not an 
adequate solution: “©... is a fact that 
languages contain structures in which the 
same position can be occupied by two or 
more classes which are not differentially 
marked. Bever et al. are correct in point- 
ing out that this sort of structure poses a 
problem for my proposals [pp. 487-488]. 

As a solution, Braine invokes the use 
of "covert categories" which he presumes 
to be "semantic." Words that are other- 
wise undifferentiated in a particular sen- 
tence type thus must have an "internal" 
analysis which differentiates them. 

We, of course, agree with this proposal 
since it reduces to the tautology that what- 
ever really is different ought to be so de- 
Scribed. But the tautology is useless un- 
less a theory for providing correct internal 
analyses is also forthcoming, and Braine's 
own discussion suggests how unlikely it is 
that such analyses can be formulated in 
semantic terms. In fact, generative gram- 
mars do provide differing types of words 
in homologous Position with distinct 
analyses; they do so largely by the use of 
transformations which can distinguish be- 
tween items that have similar Privileges of 
occurrence but which differ in the base 
structures from which they derive. Con. 
sider first a nontransformational case— 
proper names (an example proposed by 
Braine) : Presumably there are different 
internal analyses for the lexical items 
“George” and “butter” which allow the 
sentences 

George is nice, 

Butter is nice. 
and the sentence 

The butter is nice. 
but block the otherwise contextually 
generalizable . 

The George is nice. 


з Since Braine's first paper, rules for the 
introduction of lexical items into underlying 
structures have been shown to be substitu- 
tion transformations (Chomsky, 1965 ; 
Matthews, 1963) and not phrase-structure 
rules. His argument at the end of his sec- 
tion, “The Scope of the Proposals,” is there- 
fore unsound. 


But what internal markers differentiate 
the participle in these sentences ? 

Making mistakes can be annoying. 

Recurring mistakes can be annoying. 
What “markers” allow 

Many mistakes were made. 
but block 

*Many mistakes were recurred. 
The analysis which differentiates the par- 
ticiples successfully must refer to the un- 
derlying constituent structure, namely to 
the fact that “make” occurs with an object 
permissible in the UCS (and that "mis- 
takes" is that object) and that "recur 
does not (so that “mistakes” is the sub- 
ject in the UCS).4 

In short, we agree with Braine’s pro- 
posal that words and homologous sentence 
types are differentiated by characteristics 
not directly observable in the sequences 
themselves. In addition we propose that 
differences in transformational derivation 
are necessary to provide the explanation 
for many of these “covert” distinctions. 
So far as we can see, Braine has provided 


no reason whatever for rejecting this 
suggestion. 


THE “Uncrarity 


Braine states: 
in learning tr 
discuss in par 


OF TRANSFORMS” 


“What might be learned 
ansformations is difficult to 
t because the transformation 
concept itself seems unclear [p. 488].” 


he "unclarity" appears in at least two 
general ways: (a) *, 


„+ Notice that this exactly characterizes the 


distinction between “transitive” and “intransi- 
tive verbs, although this distinction is not 
apparent in the SU 


face structure of the 
sample sentences. We could decide that such 
notions as transitive, intransitive, and so on 
should be represented as “covert features” of 
lexical items without reference to the UCS. 
This Possibility js empirically rejected be- 
cause it would not eradicate an abstract 
level from the Вгаттаг, and it would result 
in duplication in the surface phrase marker 
of the distinctions already made in the UCS. 
It is never difficult to construct a theory 


more complicated than the simplest one 
currently available, 


< 


.anisms сап be foun 
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order to ensure that a subject . . . [learns] 
a transformation [p. 488]"; (b) gram- 
mars are too strong and their character 
will change as they become refined. Rules 
for derived constituent structure are not 
well understood. 

Regarding (a), it seems to us fantastic 
to require that experiments should be de- 
ducible from theories. Regarding (b). 
the questions in grammatical theory which 
await theoretical and empirical elucidation 
do not place in jeopardy the answers 
which we do have. i 


On LABORATORY EXPERIMENTATION WITH 
PRIMITIVE ARTIFICIAL LANGUAGES 


Tn our critique we suggested that ex- 
periments on artificial languages are 
equivocal with respect to natural lan- 
guage, Tt is inconclusive to show that 
the subjects learn the intended structure 
when exposed to a relatively small num- 
ber of instances either in artificial or 
natural language since the structure that 
must be assumed for natural languages is 
determined by the need to supply a simple. 
uniform treatment for a large variety of 
sentences of very specific formal char- 
acter. Finally there is no reason to be- 
lieve that simple phrase-structure artificial 
languages are on a psychological con- 
tinuum with natural language. 

Braine counters in his reply that he has 
other evidence indicating that the subjects 
did learn the assumed structure. He also 
points out that it is silly to assume that a 
child loses his natural language-learning 
capacity when he enters а laboratory. 

Of course we were only questioning the 
experiment’s capacity to bring out the 
child’s ability. There is no support for 
the assumption that the experiment with 
this restricted "language" elicits the 
mechanisms employed in learning natural 
language. Evidence that these languages 
do not elicit first-language learning mech- 
d in the recent work of 
They repeated Braine's 
h adults and obtained 

This means 


results. 
learn languages in the 


r that the experi- 


"Wales and Grant. 
Experiment TIT wit 
exactly the same 

either that adults 
same way as children o 


5 Wales, personal communication. 1965. 


ment is not tapping the language ability 
indigenous to the child. 


Tue CONFLICT BETWEEN LINGUISTIC 
ANALYSIS AND CURRENT 
LEARNING THEORY 


In our critique, we suggested that cur- 
rent models of learning are in fact incom- 
patible with the empirically based results 
of linguistic theory. Braine is sensitive 
to this, and he implicitly uses it as an 
argument against certain features of lin- 
guistic analysis; in particular, he is both- 
ered by the concept of an “abstract” un- 
derlying constituent structure for which 
no explicit representation appears. 


The distinction [between manifest and under- 
lying structure in kernel sentences] is cer- 
tainly not convenient psychologically : The 
assumption that the learner learns structures 
to which he is never exposed would indeed 
pose “serious problems for any theory of 
syntax learning [р. 486]." 


‚. . the fundamental issue at stake in this in- 


terchange [is] whether the order of elements 
ts the order in the 


in simple sentences reflec 

underlying string. . - - Bever et al. are argu- 
ing that 

. . . the learner is often never exposed to the 
pattern properties which must be assumed to 
be learned. Before accepting this argu- 
ment, I would urge that psychologists ex- 
amine the linguistic evidence for a distinction 
between manifest and underlying structure 
in kernel sentences very closely indeed 


[p. 490]. 

We of course agree. and we have tried 
to expand and clarify the empirical basis 
for the decision (a) that all sentences 


o us unlikely ол апу account 


6 Jt seems t Ў 
that adults learn languages 1n the same 
children do. This does not 


manner as 
depend on а 
language lea 
reject the poss 
tion" explains t 
guage learning. 


n assumption of "innateness" for 


rning in children, nor must it 
ibility that “proactive inhibi- 
he adult’s difficulty with lan- 
Whatever view you hold, 
it is obvious that an adult with 20-years 
X will learn. Lan- 


experience in Language . 
guage Y differently from a child that has 
had only tive experience im 
X. The fact that children and adults yield 
identical experimental results thus indicates 
that the experiment is not stimulating first- 
language learning mechanisms in children. 


2- or 3-years ac 
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have an abstract structure underlying 
them and (b) that this structure is not 
isomorphic with the simple declarative 
sentence. The essential themes of our 
discussion have been: (a) Related parts 
of a language cannot be considered in 
isolation; (b) decisions concerning the 
form of grammar are based on determin- 
ing which formalism explains the facts of 
the language with the least duplication 
and the most generality. 

The psychological role of the underly- 
ing constituent structure has not been 
carefully studied except insofar as it is an 
integral part of the account of gram- 
matical sentences themselves. Though 
linguistic evidence of this latter sort is 
not in any sense arbitrary or equivocal, 
we do feel that the exact form of the em- 
pirical extensions of the underlying con- 
stituent structure into other aspects of 
language behavior requires thorough psy- 
chological investigation. Part of that 
study will no doubt include experimental 
effects of UCS upon the recall, perception, 
or learning of sentences. A crucially im- 
portant part of that study must center on 
how the child goes about learning struc- 
tures for which there are no explicit 
models. 


There are those who agree with Braine 


that this is impossible either in fact or in 
principle. As the empirical basis for as- 
suming an abstract underlying structure 
in language becomes broader and the ex- 
planatory power of that assumption be- 
comes deeper, we recommend to all psy- 
chologists that they seriously question the 
adequacy of any theory of learning that 
cannot account for the fact that such 
structures are acquired. 
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ECOLOGICAL OPTICS AND VISUAL SLANT* 


ROBERT B. FREEMAN, JR. 


Pennsylvania State University 


Flock's “А Possible Optical Basis for Monocular Slant Perception” is 
criticized as being a theory of stimuli rather than a theory of per- 


ception. 


To account for accurate monocular slant perception, the 


theory requires 9 assumptions, including the unproved ability of the 


eye to register random texture density. 
osed that monocular visual slant is a function primarily of 


is prop 


contour perspective which varies with the si 


The alternative hypothesis 


ze, shape, and viewing 


distance, as well as slant, of plane surfaces. 


Flock (1964a), in a recent paper, 
elaborates on Gibson's (1950, 1959, 1961) 
gradient concept which relates visual 
ecology to retinal stimulation. Assuming 
that “slant perceptions .. . depend on 
optic variables [p. 391]," Flock presents 
an analysis of "optic variables" arising 
from textured, slanted surfaces. The ar- 
guments of this note are that the principal 
part of Flock’s theory is not a theory of 
perception but a description of visual 
stimuli, that the casually mentioned “abili- 
ties” of the eye to register the proposed 
visual variables of surface slants involve 
an inordinate number of assumptions for 
what they accomplish, and that the visual 
variables chosen for description in the 
paper have already been experimentally 
demonstrated to be ineffectual and un- 
necessary for the perception of slant. An 
alternative stimulus for visual slant is 


suggested. 


EcorocicAL OPTICS OF SLANT 


The purpose of Flock's paper is to show 


how 


ular slant perceptions are 


.. accurate monoc р i 
otionless viewer 


possible even though a motic 
has no previous experience with a particu- 


lar substance, even though the textural ele- 
ments of a motionless surface are irregular 
in size, shape, and separation, „and, ее 
though parts of the surface are illuminate 
differently [Flock 1964a, p. 380; italics, 


mine]. 


1 This note was written in connection wi 
research supported by Grats Unit 4 States 
National Institutes of Health, Unite 


Public Health Service- 
501 


Flock attempts to accomplish this goal 
by describing a trigonometric trans- 
formation of surface elements which re- 
mains invariant with variations in stimu- 
lus conditions other than physical slant. 
His analysis is primarily an ecological de- 
scription of the optics of slant: a descrip- 
tion of the transformations which light 
reflected from real, planar surfaces under- 
goes in projection as it converges on the 
observer's eye. 

His analysis is based on two postulates. 
Postulate I states that “substances” (pre- 
sumably surfaces) of a certain class pos- 
sess a unique pattern characteristic of 
Postulate II states that such 
e elements” which 
shape, and separa- 
tion. Flock thus allows for variability in 

ariability is 
viding the textured 


elements. 
larger the unit, an 
n, the greater the regularity.? 
explained how the eye can register such 
a unit of # like elements from among à 


? An equation is presented for "degree of 
regularity" of like elements which has some 
peculiar properties. Regularity 15 made an 
inverse function of the variability of ele- 

i ion of both 


ment size о | 
average size and total number of like ele- 
ments. Equation 4 has the result on stim- 

elements of very high varial ility can 
€ arity if there 


high degree of regul 
have а "E hem. Conversely, if the л 


are enough of tl 5 | 
is small compared to 5, negative regularity 


can result. 
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continuously variable series of like 
"S then attempts to make "optical 
slant” a function of the relative Ne gone 
angles of three such linearly E 5 
which are separated by equal visual an- 
gles perpendicular to the axis of appar- 
ent rotation. This is where Flock 8 opti- 
cal theory of slant encounters its greatest 
trouble. It can be easily shown that the 
angular subtense of any three distances 
of equal length parallel to the axis of 
rotation of a plane surface changes in a 
complex fashion with rotation of the sur- 
face. This fact suggests that the texture 
gradient at the eye arising from elements 
scattered about a plane surface will not 
be a simple one and will vary with size, 
distance, and other variables, as well as 
slant (cf. Freeman, in press—b). 

But, assuming that slanted, pl 
faces are perceived as sl 
faces at their true slant, Flock argues that 
there must be something in the stimulus 
situation which yields Such a veridical 
perception. His Equation 5 represents 
an effort to turn a complex stimulus into 
a simple one so that the eye can have a 
Simple percept, Equation 5 expresses the 
variation in visual angle subtended by a 
single unit of "optical л” with variation 
of visual slant (“optical theta”) of the 
distal surface, Although the derivation 
of Equation 5 is not made available, it 
has the Property that optical theta is in- 
variant with the number of optical units 
involved in directions both parallel and 
perpendicular to the axis of rotation, 
Thus optical theta must be unrelated to 
the size of the plane surface, This re- 
quirement makes necessary a very com. 
plex equation of a form unknown to psy- 
chophysics. As will be Shown below, 
such a complex function is unnecessary, 


FLock's PSYCHOPHYSICAL ASSUMPTIO 


anar sur- 
anted, planar sur- 


NS 

Flock has argued for a potential visual 
stimulus for slant in the manner in which 
Gibson (1961) has presented a general 
description of “ecological optics” of the 
visual environment. But in presenting a 
potential visual stimulus, he must also 
postulate a potential sensory system to re- 
spond to the potential slant stimulus or 
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optical theia. To do so, nine "3 она 
are required (Flock, 1964a, pp. 382, 389), 
the last of which is that the visual sen 
registers texture in a manner pres 
to Equation 5. Among other things, the 
eye is required to choose Hn айтор ше 
optical п for each optical unit according 
to an additional set of four rules. (Flock, 
1964a, p. 384). With 13 assumptions = 
criteria involved, Flock’s psychophysica 
theory of optical slant is complicated in- 
deed. No suggestions are made as to 
how the visual system might be expected 
to bring about such a complicated analysis. 


NEGATIVE EXPERIMENTAL EVIDENCE 


Aside from the theoretical complexities 
of optical theta there are several өеп: 
mental objections to the theory. Floc! 
(1964b) himself has already provided evi- 
dence that the optical registration of mo- 
tionless texture is poor, In his Experi- 
ment IX, using the Gibson-type shadow 
caster and an electrostatic texture, sub- 
jects adjusted a protractor to indicate the 
apparent slant of a 69° field of texture 
gradients corresponding to nine different 
physical slants at 10° intervals from 
— 40° (top toward the subject) to + 40 
(top away from the subject). Flock re- 
grettably gives the results in terms of 
regression of the subjects’ judgments of 
slant on the slant of the shadow caster 
rather than plotting the former as a func- 
tion of the latter in the usual fashion. 
But these data are sufficient to show the 
poor psychophysical correspondence 1n- 
volved, insofar as the mean regression со- 
efficient in Experiment IX was only .13 
as compared to 1.12 in Experiment VIII 
in which motion-parallactic cues were 
available with the identical stimulus situ- 
ation. The gradient arising from random 
texture at a slant is an inadequate stimu- 
lus for visual slant for the motionless 
Observer, a 

If texture gradients are not effective 
stimuli for slant, what other proximal 
stimuli are available? An earlier experi- 
ment (Clark, Smith, & Rabe, 1056) had 
compared the relative effectiveness of tex 
ture gradients and “outline gradient,” or 
Perspective. In their report, Condition B 
was a randomly textured surface slanted 


—— — 


THEORETICAL NOTES 503 


at 40° viewed through a 6-centimeter re- 
duction hole, while Condition C was a 
textureless rectangle (with the same 
solid-angle area as the hole in В) on a 
black background, also at 40° slant. 
Judged slant in C was significantly 
greater than in B.  Judged slant in 
Condition E, which combined texture 
and outline perspective, was not sig- 
nificantly greater than in the texture- 
less-rectangle condition. According to 
the results of Clark et al. (1956), out- 
line perspective has а significantly 
greater effect on judged slant than tex- 
ture gradients. And when combined. 
outline dominates as a visual stimulus. 
Another experiment (Gruber & Clark, 
1956) varied both size and texture den- 
sity of random-dot patterns as well as the 
distance of the observer. Observers 
greatly underestimated the three different 
surface slants used. Furthermore, both 
surface texture and observation distance 
had significant effects on judgments. 
These results are clearly inconsistent with 
the concept that "accurate monocular 
slant perceptions are possible even though 

. the textural elements of a motion- 
less surface are irregular in size, shape, 
and separation . . - [Flock, 1964a, p. 
380]." 

Finally, prompted by a finding by Stav- 
rianos (1945) that the judged slant of 
rectangles of constant shape varies with 
their size, this writer (Freeman, in press 
—a) conducted a parametric study to de- 
termine whether the size effect could be 
attributable to outline perspective. Tex- 
tureless rectangles of sizes ranging from 
8-40 centimeters in length were viewed 
monocularly under complete reduction 
conditions at a distance of 135 centi- 
meters. With a 24-centimeter reference 
rectangle, equal-slant contours were ob- 
tained for five slants (15°, 30°, 45°, 60°, 
and 75°), both forwards and backwards. 
The size effect on judged slant was large 
and highly significant in most of the 
Since outline perspective of 
slanted rectangles varies with the width 
and probably height, as well as physical 
slant, of the rectangles, the size effect ap- 
pears to bea function simply of projective 


outline shape. 


curves. 


CoNTOURS—PRINCIPAL STIMULI 
FOR SLANT 


In addition to the behavioral studies 
mentioned above, there is mounting evi- 
dence that the vertebrate visual system is 
"tuned" to register contours (abrupt 
brightness gradients). In addition to the 
extensive investigations of Hartline and 
Ratliff (e.g. Ratliff, 1961) on the inhibi- 
tory interactions of the lateral eye of the 
arthropod limulus, there are the discov- 
eries of neural boundary detectors in the 
optic tract of the frog (Maturana, Lett- 
vin, Pitts, & McCulloch, 1960) and in the 
visual cortex of the cat (Hubel & Wiesel, 
1962). The combination of behavioral and 
electrophysiological evidence cited above 
makes ble two general postulates 
which are in disagreement with Flock's 
optical texture gradient hypothesis : 


1. With monocular observation, visual 
shape and visual slant are a function pri- 
marily of linear outline perspective. 

2. The greater the numerical value of 
linear perspective, the greater the judged 
visual slant and the greater the effect on 


judged shape. 


The second postulate implies that appar- 
ent slant will vary with perspective, 
whether or not perspective is a true func- 
tion of physical slant, and with or with- 
out nonlinear (random ) surface texture. 
Since outline perspective varies with the 
shape, size, and distance of a plane sur- 
face as well as with slant, the apparent 
slant of stimuli so varied, when viewed 
monocularly with a motionless head under 
complete reduction conditions, must also 
vary, Flock's optical л notwithstanding. 

The contour hypothesis therefore does 
not predict veridical judgments of slant, 
in the sense of judgments which are con- 
sistent with physical rotation relative to а 
fixed observation position. On the con- 
trary, it says that visual slant will vary 
with variation in the projective character 
of stimulus contours at a slant to the 
visual axis, regardless of the environ- 
mental source of such variation. It is an 
explanation based, not on optical ecology, 
but on retinal stimulation. 1t is, finally, 


504 


a theory of perception, not a theory of 
stimuli. 
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OPTICAL TEXTURE AND 


LINEAR PERSPECTIVE AS STIMULI 


FOR SLANT PERCEPTION * 


HOWARD R. FLOCK 
Dartmouth College 


Experiments criti 
jn evoking accura 
inadequate, or deficient. 


a stimulus for slant are evaluated. 
ly of linear perspective is shown to be over- 


slant is a function primari 


simplified and hardly adequate to cope with the facts. 
Flock’s theoretical model specifying optical stimuli 


discussed. 


In his criticism of my theoretical paper 
on visual slant perception (Flock, 1964a), 
Freeman (1965) claims that variables of 
surface texture are both ineffectual and 
unnecessary for the perception of slant. 
He argues that all perceived slants are 
a function primarily of linear outline 
perspective (contour convergence) and 
that the greater the linear perspective 
the greater the judged visual slant. To 
support his argument he cites two classes 
of empirical evidence which seem to show 
that (a) the bounding contours of rec- 
tangles, not their in-lying texture, in- 
duce perceived slant (Clark, Smith, & 
Rabe, 1956a), and (b) surface texture 
by itself induces considerable underesti- 
mates of slant (Gruber & Clark, 1956; 
Flock, 1964b, Experiment IX), whereas 
textureless rectangles in isolation consist- 
ently induce rather good judgments of 
slant (Freeman, in press). In addition, 
Freeman apparently Was dissatisfied with 


my theoretical model describing poten- 
ived slant. 


tial optical variables for perceive 
(TEXTURELESS RECTANGLES VERSUS 
SunrFACE TEXTURE 
Clark et al. (19562) found that surface 
texture by itself (b = 23; in 1956b, .24)° 


ported by National 
B 2474. 
s of different 


1 This work was sup 
Foundation 


efficients, ^ 
judged slant ove? e 1 
slant employe by the experimenter 


ical of the effectiveness of variables of optical texture 
te judgments of slant are shown to be inappropriate, 
Experiments supporting linear perspective as 


The contention that perceived 


Some aspects of 
for slant are 


is a less-effective condition for accurate 
slant judgments than is a rectangle by it- 
self (b = .34 and 30; in 1955, 42 and 
.44). Moreover, adding texture to а 
textureless rectangle does not change the 
judgment of slant (b= 36). These 
studies are faithfully described by Dem- 
ber (1961) who seems to accept the gen- 
eral conclusion in favor of outline con- 
vergence. More recently Smith (1964, 
Conditions A and B) replicated the 
earlier findings and again found that 
textured and textureless rectangles do 
equally well (b = .67 and .68). 

Using Clark et al's (1956a) arrange- 
ment of apparatus, Gruber and Clark 
(1956) tested the effects on slant judg- 


ments of surface texture in isolation. 


They varied distance of surface, diameter 
dots), and 


of texture elements (white 

separation of texture elements. Regres- 
sion coefficients based on extrapolations 
from a partial graphic report of their 
data varied between .18 and .43 depending 


on distance, dot density, and dot size. 


More recently Epstein (1962), Epstein 


and Mountford (1963), and Freeman (in 
press) have found that small, textureless 


In the many cases where E used just one 
physical slant, 0° slant was added gratu- 
jtously on the assumption that had the surface 
been ‘at 0° slant, subjects’ (Ss) judgments 
would have been centered оп 2, When 
there is an identity relation between the 
levels of judged and physical slant, b = 1.0. 
and the judgments d to be OP- 


are assume! 
timal. When ь=0, judgments are at à 
chance level. 
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rectangles under complete Spor 
ditions yield almost perfect ig a 
ments.  Epstein's experiments boys кэ 
tions NR) yielded coefficients of 
69. Freeman’s experiments yielded а 
total of 45 coefficients of e md of 
mean judged slant over 10 levels of p ue 
ical slant, varying from .67 to 1.17 ^w а 
near-perfect mean and median of .99. 

At the superficial level that I have de- 
scribed these data, it would seem that 
there might be some basis for Freeman's 
(1965) claim. But let us look more 
closely at these experiments cited by 
Freeman. 


TABLE 1 


Some relevant data about these experi- 
ments are given in Table 1. The experi- 
menter (E) and experimental conditions 
are identified in Columns 1-3. The sym- 
bols T, R, and T-R in Column 4 indicate 
that the display was composed of an ex- 
tended (edges occluded from view) tex- 
tured surface (Т), of a textureless rec- 
tangle (R), or of a textured rectangle 
(T-R). Columns 5-8 give for each ex- 
periment the total angular diameter of the 
field of view (V) in degrees, the distance 
(D) in meters between the center of the 
display and the eye, the diameter (E) in 
millimeters of the texture element, and the 
separation (S) in millimeters from cen- 
ter to center of adjacent texture elements, 
The symbol 0, in degrees, indicates the 
actual slants used in an experiment. The 
symbol o gives the angular height in de- 
grees of the rectangle when at 0° slant, 
measured along the meridian perpendicu- 
lar to the axis of rotation. The column 
labeled By-Bp provides а rough measure 
of linear perspective, Bx being the angu- 
lar width in minutes of arc of the slanted 
rectangular edge nearest the eye and Вр 
in minutes of arc of the edge farthest 
from the eye when the rectangle was at 
the most extreme slant used by that E. 
Columns 12 and 13 give the angular 
widths, 8, in minutes of arc of the near- 
est (subscript N) and farthest (subscript 
F) texture element from the eye. These 
angular widths (sizes) were measured at 
the meridian that bisects and is perpen- 
dicular to the axis of rotation when the 
surface was at the most extreme slant 
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used by that E. The regression змеј 
cient is given as b (see Footnote 2 above ). 

Except for Freeman’s and Epstein's 
experiments, which will be discussed и 
By-Br for the cited experiments varie 
from 24 to 43 minutes of arc. More- 
over, the angular sizes of the cane 
in these experiments varied between б. 
and 9.8°. Consider the comparable data 
for the surface textures that were tested. 
For the cited experiments 8y-8p varied 
between 16.8 seconds of arc and 3.8 
minutes of arc. Moreover, the angular 
sizes of the dot elements, 8, were never 
greater than 16.7 and were as small as 
1.6 minutes of arc. 

For these experiments, can we be sure 
that the angular changes in the texture 
elements over a slanted surface as well 
as the texture elements themselves were 
supraliminal for Ss? After all, angular 
changes of .28 minute-3.8 minutes of arc 
and even angular white and black dots of 
1.6-16.7 minutes of arc would be expected 
to be subliminal under a variety of view- 
ing conditions for even a normal HI. 
lar eye. Despite that, not a single 
cited in Table 1 gave a solitary clue about 
the visual acuity of his Ss. Except oe 
Smith (1964), not a single E who teste 
textures reported any luminance measures 
for dots and backgrounds. Smith (1964) 
reported the luminance of the white back- 
ground but not of the black dots he used. 
It is not clear, therefore, that there was a 
discriminable change in the perspective of 
surface texture in any of these experi- 
ments, 

That Freeman could have overlooked 
this crucial question is surprising. For 
one thing, not only did I discuss the ques- 
tion of surface texture and visual acuity, 
but I also predicted what Table 1 tends 
to reveal: that as a term like 8x-8r ар- 
proaches liminal values, surface texture 
will be increasingly ineffectual in elicit- 
ing good slant judgments (Flock, 1964a, 
р. 386). Freeman’s lapse here is sur- 
prising for a second reason. In his own 
Work he attributed the failure of his 
first experiment to the use of overly 
Small rectangles, even though for i. 
rectangles By-Byp was as large as 7 
minutes of arc. He complained that 
"whatever cues were present in . ch 


THEORETICAL NOTES 


stimuli must have been very weak.” 3 
According to Freeman there was no dis- 
criminable information in a perspective 
change of 7.7 minutes of arc when the 
stimulus object was a rectangle. Why, 
then, would he expect that there would be 
discriminable information when the stim- 
ulus object was a surface texture and the 
perspective change was as small as .28 
minute-3.8 minutes of arc? 

Тє is true, then, that in these studies 
surface texture has not elicited very ac- 
curate judgments of slant. But each in- 
stance of failure occurred under pre- 
sumably near-liminal, liminal, or sub- 
liminal conditions, involving angularly 
small dot elements, angularly small 
changes in dot size, angularly small 
separations, low dot density, or short ex- 
posure and extreme variability of the ele- 
ments of texture as in Flock's (1964b) 
Experiment IX. Experimental studies 
were available in which conditions were 
more favorable for surface texture, but 
Freeman seems to have overlooked them. 
For example, in the paper that he criti- 
cized (Flock, 1964a) there were at least 
three such references (Flock, 1962; Flock 
& Moscatelli, 1964; Gibson, 1950). 

How effective is the linear perspective 
of a slanted, textureless, rectangular sur- 
face as a stimulus for slant? Is it as 
effective as Freeman implies? Except 
for  Epstein's (1962), Epstein and 
Mountford's (1963), and Freeman's (in 
press) experiments, the untextured rec- 
tangular surfaces have been between 6.3? 
and 9.8? in height. For these rectangles 
all but one of the regression coefficients 
were between .30 and .44, the exception 
being Smith's (1964) coefficient of .67. 
Curiously, that result of .67 was greater 
than anything that Smith (1956, 1959) or 
his colleagues had found earlier, even 
though the experimental conditions were 
similar in most relevant respects. Tt 
would seem, therefore, that supraliminal 
6°—10° rectangles at slants up to 40° and 
50° will reliably yield a coefficient be- 
tween .30 and perhaps 45. Та contrast. 
threshold. conditions for surface texture 
seem to yield coefficients between | .22 
and .30. Thus, under relatively optimal 
n was made in an 


зт ted ех lanatio 
bé quoted pe 1964, p. 17). 


ea: ser edition (Freeman, 


509 


conditions textureless rectangular sur- 
faces perform only slightly better than 
do surface textures under threshold con- 


ditions. 
FREEMAN'S AND EpsTEIN's EXPERIMENTS 


In Table 1 compare the results of 
Freeman and Epstein with all other Es 
who have tested the slant-inducing effects 
of textureless rectangular surfaces. Ep- 
stein and Freeman, using smaller rec- 
tangles than the other Es (see a in Table 
1), get markedly better slant judgments. 
It is as if the smaller the rectangle, the 
better slant judgments became.* There 
are some grounds, however, for disre- 
garding the findings of both of these ex- 


perimenters. 

Freeman’s (in press) data are, I be- 
lieve, artifacts of his experimental 
method. Freeman predicted that if a 
standard (ST) slanted rectangle was 
smaller than a comparison (CO) rec- 
tangle, then when S adjusted the CO so 
that both would have the same perceived 
slant, 5 would underestimate the slant of 
the ST. Moreover, he predicted that the 
greater the discrepancy in size, the 
greater the underestimation. Conversely, 
if ST > CO, then S would overestimate 
the slant of the ST.  Freeman's data 
from his second experiment purport to 
confirm this hypothesis. There are prob- 
lems, however, in accepting this interpre- 
tation. Freeman presented ST and co 
monocularly in two chambers of a tachis- 
toscope, at time intervals of ST — 1 sec- 
ond, rest = .7 second, CO=1 second. 
Both ST and CO appeared in the center 
of S’s field of view. E changed the slant 
of the CO in 2° steps between trials un- 
til, in effect, S reported they were at the 
Tt is not clear, however, 
that any Ss had to see rectangles at à 
slant in order to confirm Freeman's hy- 
pothesis. Tf CO was adjusted so that its 
projected trapezoidal image tended to be 
similar to that of the ST and so that 
the two were more or less centered in the 
field of view, one could predict all of 
Freeman’s curves. In other words, Ss' 


same slant. 


4 There is a nice irony here. Freeman's 
(in press) experimental thesis sought to 
prove that slant judgments became greater 
as rectangles were made larger. 
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tasks as they accepted them шау have иеа 
to match trapezoids for similarity, кла 
than slanted rectangles for slant equality. 

That Freeman's (in press) ша аге 
artifactual is indicated by his disc osüre 
that some of his $s in the m Lis 
periment reported that ' the stimi = : = 
appeared slanted to them. Nev erthe p У 
25 of the 27 regression coefficients for 
Ss in the first experiment lay between 
.83 and 1.17, the other two being .61 and 
67. Since Ss said they could not see the 
slant of the rectangles, they must have 
produced these remarkably good results 
in some artifactual way as I suggested 
above. Moreover, if Ss could artifactu- 
ally produce such remarkably good re- 
sults in the first experiment, could they 
not artifactually produce the same good 
results in the second experiment? The 
second experiment differed from the first 
in that larger rectangles were used, The 
results of the second experiment were 
very similar to the first, 17 of 18 coeffi- 


cients lying betwen .81 and 1.14, the 18th 
being .76. 


There are other curiosities 
man's (in press) article. 
l-centimeter rectangle was at 
he says that the angle s 
nearest edge minus the 
by the farthest edge wa 
arc. Freeman says tha 
below the limits of vis 
presumably, therefore, one would expect 
that Ss’ judgments of slant should be 0°. 
In fact the mean judged slants for the 
1-centimeter rectangle at 60° slant were 
32°, 41°, 63°, and 58°, of which three 
are near approximations of the correct 
slant. Should not Freeman have ex- 
plained how his Ss managed this re. 
markable feat? (When the rectangles 
were very small, Ss were told the direc- 
tion of their slant and were both told and 
shown their true shape and Size. Ss 
could then use the ratio of the angular 
projections of the slanted height to the 
unslanted width as a clue to the rectangle's 
slant.) А 

Epstein's (1962) and Epstein 
Mountford's (1963) data should also not 
be taken too seriously. Epstein. photo- 
graphed a textureless rectangle, size un- 

specified, from which he made an Ekta- 


in Free- 
When the 
60° slant, 
ubtended by the 
angle subtended 
S 2.5 seconds of 
t this was “well 
ual acuity”; and 


and 
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chrome transparency, image size unspeci- 
fied. The calculations in Table 1 are 
based on the assumption that, trans- 
parencies being about 1 inch, the image 
height on the transparency was at least 
1 inch (although it might have been less). 
Since the rectangle was the size of a 
playing card, one might assume that the 
image width was .8 inch, On the basis 
of these assumptions Bx-Bp for slants 
of 159, 30%, 45°, and 60° was 20, 3; 
55, and .65 minutes of arc, respectively. 
Despite these small perspective changes 
mean judged slants for the four slant 
levels were 15°, 30°, 33°, and 44° (Ep- 
stein, 1962) and 20°, 29°, 31°, and 54 
(Epstein & Mountford, 1963). These 
performances were so remarkably good, 
one wonders whether artifacts were not 
present in these experiments also. 

On grounds other than these, however, 
it may be desirable not to take Epstein's 
data seriously, For part of his 1963 study 
with Mouritford he used trapezoids in a 
frontal plane as STs, comparing the slant 
judgments they induced with those in- 
duced by physically slanting the trans- 
parencies. His trapezoids were wholly 
incommensurate with the slanted rec- 
tangles, however, The trapezoids were 
Projections of much more extremely 
slanted rectangles than of the slanted 
rectangles with which they were com- 
Pared. Although he did refer to these 
as “slight” discrepancies, that fact did 
not dissuade him from concluding that 
frontal trapezoids induced better slant 
judgments than the slanted rectangles of 
which they were Purported to be the pro- 
jection. Nor did it persuade him to dis- 
card the data and Start over again. 
(Smith, 1964, for example, attempted 
and failed to replicate his findings.) 

Gruper AND CLank's EXPERIMENT 

Gruber and Clark's (1956) experi- 
ment (see above and Table 1 for details) 
purports to be far more critical of my 
theoretical position than for the reason 
given by Freeman (1965, p. 503). Gruber 
and Clark report only part of their data, 
in the form of four curves (their Figure 
2). The relative position of these curves 
acquired great significance for Gruber 


E a U — ——— - ——————_ 
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and Clark and later for Eriksson (1964).5 
Gruber and Clark theorized that judged 
slant depended on a cortical or retinal 
interaction unrelated to measures of visual 
acuity. They predicted that dots which 
are close together or far apart will yield 
slant judgments that are worse than for 
dots which are at interim separations. 
Moreover, they predicted that there would 
be increasingly accurate slant judgments 
as one went from extremely small or 
extremely large to moderate separations. 
In their second experiment they tested 
and confirmed. these predictions. 

Consider the facts, however. Except 
for their viewing distance of 1.5 meters 
the total fluctuation over all their mean 
judged slants was apparently less than 
6.5? (extrapolating from their Figure 2). 
This range of 6.5? determined the relative 
positions of their four curves. Their 
display of 3-millimeter dots with 8-nilli- 
meter separations at 1.5 meters yielded an 
apparent mean judged slant of 10.1°, 
whereas their display of 6-millimeter dots 
with 16-millimeter separations at 3.0 
meters yielded an apparent mean judged 
slant of 14.5°, a difference of 44°. These 
two displays were optically identical. Tf 
under the same conditions the same Ss 
vary 44°, can one seriously entertain in- 
terpretations of four curves based on a 
total fluctuation among them of 6.5°? 

In their second experiment they con- 
structed three surfaces each textured with 
round, white 6-millimeter dots. The 
separation of the dots measured from 
center to center for the three surfaces was 
6, 30, and 90 millimeters. А 6-millimeter 
dot separated by 6-millimeters means that 
the dots had to be adjacent to each other. 
leaving an interspace where the dots met 
of approximately 1.5 millimeters in height 
and shaped like an equilateral triangle. 
At the viewing distance of 1,380 milli- 
meters with a field of view of 15.9° and 
a single surface slant of 43°, the angular 
length of that interspace at places farthest 
from the eye was about 2 minutes of arc. 
Mean judged slant was 10.4°. Is E 
not possible that, for some Ss over pre- 
sumably the 48 trials of the experiment, 
visual acuity for so small ап interspace 

5 Most of the criticisms leveled against 
Gruber and Clark apply equally to Eriksson. 
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could have sometimes been subliminal? 
For some Ss on some trials perhaps the 
farthest part of the surface, perhaps all 
of the surface, appeared homogeneous 
and untextured. Gruber and Clark do 
not report any data at all about the visual 
acuity of their Ss. They do not report 
illuminance or dot-background contrast. 
Т would predict that judgments of slant 
ought to be poor under their conditions. 
At the other extreme, when dot separa- 
tion was 90 millimeters, there were 20 
dots over the entire display, and mean 
judged slant was 6.7°. For the interim 
condition when dot separation was 30 
millimeters there were 180 elements over 
the display. and judged slant was 13.5*. 
In other words, with an increase in 
density from 20 to 180, judged slant im- 
proved, as my Equation 4 predicts 
(Flock, 1964a, p. 383). When dot ele- 
ments were made so dense that for some 
Ss parts of the surface might have ap- 
peared untextured, perceived slant de- 
creased. One hardly need postulate a 
theory of cortical or retinal interaction to 
explain that result. Moreover, if their 
interaction hypothesis depends on the ex- 
treme conditions of their second experi- 
ment, its relevance to optical variables of 


texture is debatable. 


LIMITATIONS OF CONTOUR CONVERGENCE 


Freeman's reluctance to accept any 


other variable than the contour con- 


vergence of slanted rectangles as а 
stimulus for perceived slant is hard to 
reconcile with the facts of man's natural 
environment. In nature, apart from 
civilized structures, there seem to be few 
rectangles and rectangular-like elements. 
Nevertheles animals and men (in the 
jungle. for example) make responses that 
are neatly attuned to changes in the 
slants of things, just as if they were 
correctly perceiving slants. Correspond- 
ingly, how does one use linear per- 
spective to explain the following unpub- 
lished experiment by Flock and Graves? 

An extended surface was constructed 
with 518 different-sized, different-shaped 


distributed randomly over it. 
were normally 


mean height 
at the 


triangles 
All surface parameters 
distributed, and the angular 
and g of the distribution of shapes 
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center of the display when the surface was 
at 0° slant were 4.5° and 2.33°, = 
tively. The surface was placed at six ues 
ferent slants and was seen throug! 
apertures of 20°, 40°, and 80°. edges 
sion coefficients for the three pcs 
view with 4.3, 19, and 126 shapes visible 
at 30? slant were .10, .30, and -69, re- 
spectively. A second group of Ss judged 
the slant of a small (14.03°), medium 
(32.25°), and large ‚(66.25 ) triangle 
under the same conditions and produced 
coefficients of .09, .03, and —.04, Tespec- 
tively. Thus, contours of triangles in- 
duced no slant regardless of their size, 
whereas textures composed of small tri- 
angles induced slant judgments that be- 
came progressively better as field of view 
was increased. Can Freeman explain 
these results in terms of contour conver- 
gence? Tt is doubtful that he can explain 
any slant data in which a rectangle is 
not presented. His unqualified claim 
that contour convergence is the stimulus 
for perceived slant is both oversimplified, 
because it does not handle all of the avail- 
able data, and premature, because it is 


even less capable of coping with the kind 
of data just presented, 


THE SPECIAL Case or THE RECTANGLE 
Nowhere have I as 
Optics of surface texture constitute the 
stimuli for slant or are the sole deter- 
miners of its perception. I have never 
made any claim for variables of optical 
texture other than to show that they 
could explain some old problems in per- 
ception, For example, given variables 
of optical texture, it is not necessary to 
introduce into Space perception many of 
the subjectivist assumptions with which 
we have been burdened (see also Flock, 
1964c). The theoretical model I have 
proposed makes it possible to give an 
account of how perspective cues like 
linear perspective, size Perspective, and 
motion parallax are related. Freeman’s 
(1965) paper does raise the question, 
however, of how and whether the iso- 
lated slanted rectangle is related to my 
theoretical formulation. The answer is 
that the rectangle can be considered an 
extreme and special case of the formula- 


serted that the 
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tion There are many aspects of рег. 
spective, however, that the theoretica 
model will not handle. That fact does not 
mean that either the theoretical model 
or some event it will not handle is 
bad, as is implied by Clark et al. (1956b) 
and Smith (1956) whose taxonomy = 
to the elliptical projection of a slanted 
circle as a “distortion.” What it does 
mean is that the theoretical model has 
limitations, . 

In the theoretical model the idea of 
redundancy of optical information was 
developed. It was shown that there 
would be redundant classes of congruent 
luminous patterns, any one of which 
would theoretically specify a distal sur- 
face slant. These redundant sources of 
information might refer to a distal rec- 
tangle, to the surface texture of that 
rectangle, or to a Specific structural as- 
pect of that surface texture. What makes 
a particular source of information an ef- 
fective rather than a potential stimulus 
for a visual system could depend on a 
variety of factors, for example, viewing 
conditions, visual resolving powers, a 
variety of physiological conditions, in- 
structions, attentional factors, an endless 
variety of experiental factors, etc. (see 
Flock, 1963, 1964a; Flock & Moscatelli, 
1964; and for the distinction between po- 
tential and effective stimuli, see Gibson, 
1960). For these reasons, pitting the 
force of one class of perspective informa- 
tion against the force of a second, as is 
favored by many Es cited above, might 

в With 
center of 
normal m 
tween the 


the optic fixation point at the 
the projected trapezoid, with the 
eridian angularly equidistant be- 
parallel trapezoidal bases and ap- 
proximately projecting the true axis of ro- 
tation, with the great circle arc through the 
optic fixation point and perpendicular to the 
normal meridian, it will then be true along 
the normal meridian and all meridians paral- 
lel to it that (a) angular extents of the pro- 
jected trapezoid will be bilaterally sym- 
metrical on either side of the great circle 
arc, (b) there will be a gradient of those 
angular extents along the great circle arc, 
and (c) therefore Criteria I-IV will be met, 
although the conditions are special. The 
slant of the rectangle is then given as speci- 
fied by the derivation of optical theta (see 
Flock, 1964a, pp. 384-388). 
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yield surprisingly little useful information 
about the general effectiveness of a par- 
ticular stimulus for space perceptions. 


PHYSIOLOGICAL EVIDENCE 


Freeman finds in several physiological 
studies support for contour convergence 
as the "principal stimulus" for slant. The 
reasoning (1965, p. 503) is that the “vis- 
ual system is 'tuned' to register contours 
(abrupt brightness gradients)." Freeman 
seems to equate linear perspective with all 
contours and with all brightness gradi- 
ents. In fact, my lengthy discussion of 
congruent luminous patterns depends on 
the assumption that visual systems are 
tuned to register contours and abrupt 
brightness gradients (Flock, 1964a, pp. 
380-382). How else could the optical 
texture be registered by the organism? 


Frock's EQUATION 4 


Freeman (1965, Footnote 2) attributes 
to this equation peculiar properties, of 
which negative values is one. But the 
equation cannot take negative values. 
The equation depends on the assumption 
that the parameters of a texture are, if 
not normally, at least rectangularly dis- 
tributed (Flock, 1964a, p. 382). Under 
those conditions it cannot have negative 
values, Even without such restriction, 
the conditions under which it could take 
negative values are very extreme. 

Equation 4 should not be considered 
more than a first statement of the relation 
between size of visual field, size of ele- 
ment, and variability of elements, on the 
one hand, and stochastic regularity from 
the point of view of optics, on the other. 
Its function at present is a strategic one. 
Perhaps for the first time three extremely 
important optical variables have been 
brought together into a rational relation- 
ship. Until the effects of these variables 
have been empirically tested, their pre- 
cise relationship must remain unknown. 


Frock's EQUATION 5 

to Freeman (1965, p. 502) 
cal theory of slant encounters 
ouble" when it relates the 
{ the surface to ratios of 
han to the surface’s “size, 
her variables, as well as 


According 
“Flock’s opti 
its greatest tr 
optical slant o 
angles, rather tl 
distance, and ot 
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slant.” He then cites some commonplaces 
about perspective and implies that they 
make invalid my Equation 5. He is in- 
correct. Equation 5 is invariant over the 
changes in conditions which he describes. 
Freeman is simply wrong when he at- 
tempts to discuss the mathematics of per- 
spective.? Finally, his complaint that 
the derivation of Equation 5 was not 
available is unwarranted (Flock, 1964a, 


Footnote 6). 


Sensory ABILITIES 


Freeman raises questions about the 
sensory abilities presumed to be neces- 
sary for the registration of the optical 
variables that specify the slant of a sur- 
face. Most of the presumed abilities are 
not very extraordinary. I do want to be 
explicit about one visual task, however, 
that is crucial to the assumption that these 
potential optical variables might be per- 
ceptually effective. In the section on 
“Magnification” I discussed the luminous 
structure of the optical texture. I sug- 
gested that an optical instrument like the 
eye could scan a small region of the opti- 
cal texture in some angularly uniform 
step, register changes in absolute (but 
averaged) luminances for successive 
steps, adjust the angular magnitude of 
the step in scanning a different region of 
the optical texture, and thereby identify 
congruent luminous patterns over the 
optical texture. Whether an eye could 
make this kind of optical analysis is, of 
course, an empirical question. Pursuing 
that empirical question is only one of the 

т Consider Freeman’s (in press) use of 
mathematics in his own work. That his 
Equation 2 is false should be self-evident. 

d have reasonable 


For example, letting a and 
— 0, tan à should ap- 


values and letting с р 
proach 0. His tan 8 does not have this 
property. His statement that т is un- 


changed by the height of the rectangle is 
self-evident but has absolutely nothing to 
do with the stimulus at the eye or with his 
argument. The angular size of 7 at the eye 
very definitely changes with the height of the 
rectangle. He then compares 7 with 3, the 
former being a base angle on a projection 
plane, the latter being the angular differ- 
ence of a different event at the eye. Fo 
a first-rate analysis of the complexities © 


perspective, see Gibson (1957)- 
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possible strategies, however. I have re- 
cently been fascinated by the many-sided 
hypotheses that can be generated from 
the theoretical model and by the possibil- 
ity of testing such hypotheses. For ex- 
ample, applying Jameson and Hurvich's 
(1964) achromatic color theory to the 
visual problem of identifying congruent 
luminous patterns in the optical texture, 
it seems to me that it might be possible 
to predict systematic errors in slant and 
distance perception of extended surfaces 
as a function of changing illuminance and 
luminance levels. Hypotheses that have 
been tested and are in the process of 
being tested have been discussed else- 
where (Flock, 1964a). The data from 
these tests have not been published as yet, 
however. 


CoNCLUSION 


Freeman calls this theoretical model a 
theory of the stimulus and calls his own 
a theory of perception. I do not know 
what is gained by that distinction. Pre- 
sumably, theories that account for and 


predict empirical results will flourish, the 
others will not. 
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